Nvidia announces next-gen RTX 5090 and RTX 5080 GPUs | Better HN

Nvidia announces next-gen RTX 5090 and RTX 5080 GPUs | Better HN

759 comments

numpy-thagoras1y ago

Similar CUDA core counts for most SKUs compared to last gen (except in the 5090 vs. 4090 comparison). Similar clock speeds compared to the 40-series.

The 5090 just has way more CUDA cores and uses proportionally more power compared to the 4090, when going by CUDA core comparisons and clock speed alone.

All of the "massive gains" were comparing DLSS and other optimization strategies to standard hardware rendering.

Something tells me Nvidia made next to no gains for this generation.

laweijfmvo1y ago

I started thinking today, when Nvidia seemingly keeps just magically increasing performance every two years, that they eventually have to "intel" themselves, where they haven't made any real architectural improvements in ~10 years and just suddenly power and thermals don't scale anymore and you have six generations of turds that all perform essentially the same, right?

it's possible, but idk why you would expect that. just to pick an arbitrary example since steve ran some recent tests, a 1080 ti is more or less equal to a 4060 in raster performance, but needs more than double the power and a much more die area to do it.

https://www.youtube.com/watch?v=ghT7G_9xyDU

we do see power requirements on the high end parts every generation, but that may be to maintain the desired SKU price points. there's clearly some major perf/watt improvements if you zoom out. idk how much is arch vs node, but they have plenty of room to dissipate more power over bigger dies if needed for the high end.

ryao1y ago

Nvidia is a very innovative company. They reinvent solutions to problems while others are trying to match their old solutions. As long as they can keep doing that, they will keep improving performance. They are not solely reliant on process node shrinks for performance uplifts like Intel was.

TrainedMonkey1y ago

There is one major 4 letter difference - TSMC. Nvidia will get tech process improvements until TSMC can't deliver, and if that happens we have way bigger problems... because Apple will get mad they can't reinvent iPhone again... and will have to make it fun and relatable instead by making it cheaper and plastic again.

> All of the "massive gains" were comparing DLSS and other optimization strategies to standard hardware rendering.

> Something tells me Nvidia made next to no gains for this generation.

Sounds to me like they made "massive gains". In the end, what matters to gamers is

1. Do my games look good? 2. Do my games run well?

If I can go from 45 FPS to 120 FPS and the quality is still there, I don't care if it's because of frame generation and neural upscaling and so on. I'm not going to be upset that it's not lovingly rasterized pixel by pixel if I'm getting the same results (or better, in some cases) from DLSS.

To say that Nvidia made no gains this generation makes no sense when they've apparently figured out how to deliver better results to users for less money.

bitmasher91y ago

Rasterizing results in better graphics quality than DLSS if compute is not a limiting factor. They are trying to do an apples to oranges comparison by comparing the FPS of standard rendering to upscaled images.

I use DLSS type tech, but you lose a lot of fine details with it. Far away text looks blurry, textures aren’t as rich, and lines between individual models lose their sharpness.

Also, if you’re spending $2000 for a toy you are allowed to have high standards.

dragontamer1y ago

Because if two frames are fake and only one frame is based off of real movements, then you've actually lost a fair bit of latency and will have noticably laggier controls.

Making better looking individual frames and benchmarks for worse gameplay experiences is an old tradition for these GPU makers.

hervature1y ago

These are NVidia's financial results last quarter:

- Data Center: Third-quarter revenue was a record $30.8 billion

- Gaming and AI PC: Third-quarter Gaming revenue was $3.3 billion

If the gains are for only 10% of your customers, I would put this closer to the "next to no gains" rather than the "massive gains".

pshc1y ago

DLSS artifacts are pretty obvious to me. Modern games relying on temporal anti aliasing and raytracing tend to be blurry and flickery. I prefer last-gen games at this point, and would love a revival of “brute force” rasterization.

everdrive1y ago

If you're doing frame generation you're getting input lag. Frame generation from low framerates is pretty far from ideal.

throwaway484761y ago

Fake frames, fake gains

CSDude1y ago

I have 2070 Super. Latest Call of Duty runs on 4k with good quality using DLSS with 60 fps and I can't notice at all (unless I look very closely, even with my 6k ProDisplay XDR) so yeah I was thinking of building a 5090 based computer and it will probably last many more years than my 2070 super with latest AI developments.

>Do my games look good

i d like to point you to r/FuckTAA

>Do my games run well

if the internal logic is still in sub 120 hz and it is a twichy game, then no

Any frame gen gains don’t improve latency so the usefulness is reduced

Salgat1y ago

The 5090's core increase (30%) is actually underwhelming compared to the 3090->4090 increase (60% more), but the real game changer is the memory improvements, both in size and bandwidth.

NoPicklez1y ago

Jensen did say that in the presentation that compute performance isn't increasing at large enough scales to make enough change. The shift is moving to reliance on using AI to improve performance and there are additions in hardware to accommodate that.

Isn't not being kept a secret, its being openly discussed that they need to leverage AI for better gaming performance.

If you can use AI to go from 40fps to 120fps with near identical quality, then that's still an improvement

az2261y ago

Flops went up 26% and power draw 28%.

So the biggest benefit is PCIe 5 and the faster/more memory (credit going to Micron).

This is one of the worst generational upgrades. They’re doing it to keep profits in the data center business.

nullbyte8081y ago

not true. They have redesigned AI cores with a dramatically better DLSS4 model that takes advantage of the new cores. Frames have more details and also a third frame can be generated creating a 300% FPS bump.

ks20481y ago

This is maybe a dumb question, but why is it so hard to buy Nvidia GPUs?

I can understand lack of supply, but why can't I go on nvidia.com and buy something the same way I go on apple.com and buy hardware?

I'm looking for GPUs and navigating all these different resellers with wildly different prices and confusing names (on top of the already confusing set of available cards).

OK so there are a handful of effects at work at the same time.

1. Many people knew the new series of nvidia cards was about to be announced, and nobody wanted to get stuck with a big stock of previous-generation cards. So most reputable retailers are just sold out.

2. With lots of places sold out, some scalpers have realised they can charge big markups. Places like Amazon and Ebay don't mind if marketplace sellers charge $3000 for a $1500-list-price GPU.

3. For various reasons, although nvidia makes and sells some "founder edition" the vast majority of cards are made by other companies. Sometimes they'll do 'added value' things like adding RGB LEDs and factory overclocking, leading to a 10% price spread for cards with the same chip.

4. nvidia's product lineup is just very confusing. Several product lines (consumer, workstation, data centre) times several product generations (Turing, Ampere, Ada Lovelace) times several vram/performance mixes (24GB, 16GB, 12GB, 8GB) plus variants (Super, Ti) times desktop and laptop versions. That's a lot of different models!

nvidia also don't particularly want it to be easy for you to compare performance across product classes or generations. Workstation and server cards don't even have a list price, you can only get them by buying a workstation or server from an approved vendor.

Also nvidia don't tend to update their marketing material when products are surpassed, so if you look up their flagship from three generations ago it'll still say it offers unsurpassed performance for the most demanding, cutting-edge applications.

ryao1y ago

The workstation cards have MSRPs. The RTX 6000 Ada’s MSRP is $6799:

https://www.techpowerup.com/gpu-specs/rtx-6000-ada-generatio...

doix1y ago

Nvidia (and AMD) make the "core", but they don't make a "full" graphics card. Or at least they don't mass produce them, I think Nvidia tried it with their "founders edition".

It's just not their main business model, it's been that way for many many years at this point. I'm guessing business people have decided that it's not worth it.

Saying that they are "resellers" isn't technically accurate. The 5080 you buy from ASUS will be different than the one you buy from MSI.

sigmoid101y ago

Nvidia also doesn't make the "core" (i.e. the actual chip). TSMC and Samsung make those. Nvidia designs the chip and (usually) creates a reference PCB to show how to make an actual working GPU using that chip you got from e.g. TSMC. Sometimes (especially in more recent years) they also sell that design as "founders" edition. But they don't sell most of their hardware directly to average consumers. Of course they also provide drivers to interface with their chips and tons of libraries for parallel computing that makes the most of their design.

Most people don't realize that Nvidia is much more of a software company than a hardware company. CUDA in particular is like 90% of the reason why they are where they are while AMD and Intel struggle to keep up.

SirMaster1y ago

They still make reference founders editions. They sell them at Best Buy though, not directly.

Didn't Nvidia piss of some of their board partners at some point. I think EVGA stopped making Nvidia based graphics cards because of poor behavior on Nvidia part?

Also aren't most of the business cards made by Nvidia directly... or at least Nvidia branded?

orphea1y ago

  it's not worth it.

I wonder how much "it's not worth it". Surely it should have been at all profitable? (a honest question)

grogenaut1y ago

The founders edition ones that I had were not great gpus. They were both under cooled and over cooled. They had one squirrel cage style blower that was quite loud and powerful and ran bascially at no speed or full blast. But being that it only had the one airpath and one fan it got overwhelmed by dust or if that blower fan had issues the gpu over heated. The consumer / 3rd party made ones usually have multiple fans at lower speeds larger diameter, multiple flow paths, and more control. TL;DR they were better designed, nvidia took the data center ram as much air as you can in there approach which isn't great for your home pc.

zitterbewegung1y ago

This is supply and demand at work. NVIDIA has to choose to either sell consumer or high end and they can reserve so much resources from TSMC. Also, Apple has outsold hardware before or it has high demand when it releases but for NVIDIA they have nearly constant purchases throughout the year from enterprise and also during consumer product launches.

diob1y ago

It is frustrating speaking as someone who grew up poor and couldn't afford anything, and now I finally can and nothing is ever in stock. Such a funny twist of events, but also makes me sad.

brokenmachine1y ago

Imagine how sad you'd be if you were still poor.

pseudosavant1y ago

If you think it is bad for Nvidia, give AMD a try. Go ahead and try to guess which GPU is the most powerful by model number. They give so many old parts new model numbers, or have old flagship parts they don't upgrade in the next generation that are still more powerful.

TrackerFF1y ago

GPUs are in demand.

So scalpers want to make a buck on that.

All there is to it. Whenever demand surpasses supply, someone will try to make money off that difference. Unfortunately for consumers, that means scalpers use bots to clean out retail stores, and then flip them to consumers.

Without thinking about it too deeply I'm wondering if GPU demand is that much higher than let's say iPhone demand. I don't think I've ever heard of iPhones being scarce and rare and out of stock.

ggregoire1y ago

I read your question and thought to myself "why is it so hard to buy a Steamdeck"? Available only in like 10 countries. Seems like the opposite problem, Valve doesn't use resellers but they can't handle international manufacturing/shipping themselves? At least I can get a Nvidia GPU anytime I want from Amazon, BestBuy or whatever.

throwaway3141551y ago

> At least I can get a Nvidia GPU anytime I want from Amazon, BestBuy or whatever.

You can? Thought this thread was about how they're sold out everywhere.

Maybe, it is simply a legacy business model. Nvidia wasn't always a behemoth. In olden days they must be happy for someone else to manage the global distribution, marketing, service etc. Also, this gives an illusion of choice. You get graphic cards in different color, shape, RGB, water cooling combinations.

chis1y ago

One way to look at is that the third party GPU packagers have a different set of expertise. They generally build motherboards, GPU holder boards, RAM, and often monitors and mice as well. All of these product PCBs are cheaply made and don't depend on the performance of the latest TSMC node the way the GPU chips do, more about ticking feature boxes at the lowest cost.

So nvidia wouldn't have the connections or skillset to do budget manufacturing of low-cost holder boards the way ASUS or EVGA does. Plus with so many competitors angling to use the same nvidia GPU chips, nvidia collects all the margin regardless.

Yet the FE versions end up cheaper than third party cards (at least by MSRP), and with fewer issues caused by the third parties cheaping out on engineering…

I've always assumed their add-in board (AIB) partners (like MSI, ASUS, Gigabyte, etc) are able to produce PCBs and other components at higher volumes and lower costs than NVIDIA.

xnyan1y ago

Not just the production of the finished boards, but also marketing, distribution to vendors and support/RMA for defective products.

There is profit in this, but it’s also a whole set of skills that doesn’t really make sense for Nvidia.

the__alchemist1y ago

It depends on the timing. I lucked out about a year ago on the 4080; I happened to be shopping in what turned out to be the ~1 month long window where you could just go to the nvidia site, and order one.

datadrivenangel1y ago

Nvidia uses resellers as distributors. Helps build out a locked in ecosystem.

ks20481y ago

How does that help "build out a locked in ecosystem"? Again, comparing to Apple: they have a very locked-in ecosystem.

The increasing TDP trend is going crazy for the top-tier consumer cards:

3090 - 350W

3090 Ti - 450W

4090 - 450W

5090 - 575W

3x3090 (1050W) is less than 2x5090 (1150W), plus you get 72GB of VRAM instead of 64GB, if you can find a motherboard that supports 3 massive cards or good enough risers (apparently near impossible?).

I got into desktop gaming at the 970 and the common wisdom (to me at least, maybe I was silly) was I could get away with a lower wattage power supply and use it in future generations cause everything would keep getting more efficient. Hah...

For the curious what I actually did was stop gaming and haven't bought a GPU since 2000's! GPU stuff is still interesting to me, though.

omikun1y ago

I went from 970 to 3070 and it now draws less power on average. I can even lower the max power to 50% and not notice a difference for most games that I play.

epolanski1y ago

Yeah, do like me, I lower settings from "ultra hardcore" to "high" and keep living fine on a 3060 at 1440p for another few gens.

I'm not buying GPUs that expensive nor energy consuming, no chance.

In any case I think Maxwell/Pascal efficiency won't be seen anymore, with those RT cores you get more energy draw, can't get around that.

_carbyau_1y ago

I thought opposite. My powersupply is just another component. I'll upgrade it as I need to. But keeping it all quiet and cool...

I built a gaming PC aiming to last 8-10 years. I spent $$$ on MO-RA3 radiator for water cooling loop.

My view:

1. a gaming PC is almost always plugged into a wall powerpoint

2. loudest voices in the market always want "MOAR POWA!!!"

1. + 2. = gaming PC will evolve until it takes up the max wattage a powerpoint can deliver.

For the future: "split system aircon" built into your gaming PC.

6SixTy1y ago

Nvidia wants you to buy their datacenter or professional cards for AI. Those often come with better perf/W targets, more VRAM, and better form factors allowing for a higher compute density.

For consumers, they do not care.

PCIe Gen 4 dictates a tighter tolerance on signalling to achieve a faster bus speed, and it took quite a good amount of time for good quality Gen 4 risers to come to market. I have zero doubt in my mind that Gen 5 steps that up even further making the product design just that much harder.

throwaway484761y ago

In the server space there is gen 5 cabling but not gen 5 risers.

This is the #1 reason why I haven’t upgraded my 2080 Ti. Using my laser printer while my computer is on (even if it’s idle) already makes my UPS freak out.

But NVIDIA is claiming that the 5070 is equivalent to the 4090, so maybe they’re expecting you to wait a generation and get the lower card if you care about TDP? Although I suspect that equivalence only applies to gaming; probably for ML you’d still need the higher-tier card.

The big grain of salt with that "the 5070 performs like a 4090" is that it is talking about having the card fake in 3 extra frames for each one it properly generates. In terms of actual performance boost a 5070 is about 10% faster than a 4070.

UltraSane1y ago

Why would you have your laser printer connected to your UPS?

Reason0771y ago

Does a laser printer need to be connected to a UPS?

pshirshov1y ago

Your UPS is improperly sized. A 5kW Victron Multiplus II with one Pylontech US5000 would cost you around €1600 and should be able to carry all your house, not just your printer.

iwontberude1y ago

That’s because you have a Brother laser printer which charges its capacitors in the least graceful way possible.

iandanforth1y ago

Sounds like you might be more the target for the $3k 128GB DIGITS machine.

gpm1y ago

Weirdly they're advertising "1 petaflop of AI performance at FP4 precision" [1] when they're advertising the 5090 [2] as having 3352 "AI TOPS" (presumably equivalent to "3 petaflops at FP4 precision"). The closest graphics card they're selling is the 5070 with a GPU performing at 988 "AI TOPS" [2]....

[1] https://nvidianews.nvidia.com/news/nvidia-puts-grace-blackwe...

[2] https://www.nvidia.com/en-us/geforce/graphics-cards/50-serie...

I’m really curious what training is going to be like on it, though. If it’s good, then absolutely! :)

But it seems more aimed at inference from what I’ve read?

zitterbewegung1y ago

Instead of risers just use pcie ender cords and you can get 4x 3090's working with a creator motherboard (google one that you know can handle 4). You could also use a mining case to do the same.

But, the advantage is that you can load a much more complex model easily (24GB vs 32GB is much easier since 24GB is just barely around 70B parameters).

You don't need to run them in x16 mode though. For inference even half that is good enough.

mikae11y ago

Performance per watt[1] makes more sense than raw power for most consumer computation tasks today. Would really like to see more focus on energy efficiency going forward.

[1] https://en.wikipedia.org/wiki/Performance_per_watt

epolanski1y ago

That's s blind way to look at that imho. Doesn't work on me for sure.

More energy means more power consumption, more heat in my room, you can't escape thermodynamics. I have a small home office, it's 6 square meters, during summer energy draw in my room makes a gigantic difference in temperature.

I have no intention of drawing more than a total 400w top while gaming and I prefer compromising on lowering settings.

Energy consumption can't keep increasing over and over forever.

I can even understand it on flagships, they meant for enthusiasts, but all the tiers have been ballooning in energy consumption.

What I really don't like about it is low power GPUs appear to be a thing of the past essentially. An APU is the closest you'll come to that which is really somewhat unfortunate as the thermal budget for an APU is much tighter than it has to be for a GPU. There is no 75W modern GPU on the market.

justincormack1y ago

the closest is the L4 https://www.nvidia.com/en-us/data-center/l4/ but its a bit weird.

Innodisk EGPV-1101

abrookewood1y ago

Sooo much heat .... I'm running a 3080 and playing anything demanding warms my room noticeably.

ryao1y ago

I wonder how many generations it will take until Nvidia launches a graphics card that needs 1kW.

faebi1y ago

I wish mining was still a thing, it was awesome to have free heating in the cold winter.

skocznymroczny1y ago

In theory yes, but it also depends on the workload. RTX 4090 is ranking quite well on the power/performance scale. I'd rather have my card take 400W for 10 minutes to finish the job than take only 200W for 30 minutes.

Scene_Cast21y ago

I heavily power limited my 4090. Works great.

Yep. I use ~80% and barely see any perf degradation. I use 270W for my 3090 (out of 350W+).

saomcomrad561y ago

It's good to know can all heat our bedrooms while mining shitcoins.

soon you'll need to plug your PC into the 240 V dryer outlet lmao

(with the suggested 1000 W PSU for the current gen, it's quite conceivable that at this rate of increase soon we'll run into the maximum of around 1600 W from a typical 110 V outlet on a 15 A circuit)

Can you actually use multiple videocards easily with existing AI model tools?

Yes, though how you do it depends on what you're doing.

I do a lot of training of encoders, multimodal, and vision models, which are typically small enough to fit on a single GPU; multiple GPUs enables data parallelism, where the data is spread to an independent copy of each model.

Occasionally fine-tuning large models and need to use model-parallelism, where the model is split across GPUs. This is also necessary for inference of the really big models, as well.

But most tooling for training/inference of all kinds of models supports using multiple cards pretty easily.

benob1y ago

Yes, multi-GPU on the same machine is pretty straightforward. For example ollama uses all GPUs out of the box. If you are into training, the huggingface ecosystem supports it and you can always go the manual route to put tensors on their own GPUs with toolkits like pytorch.

dpeterson1y ago

I just made a video on this very thing: https://youtu.be/JtbyA94gffc

qingcharles1y ago

Yes. Depends what software you're using. Some will use more than one (e.g. llama.cpp), some commercial software won't bother.

most household circuits can only support 15-20 amps at the plug. there will be an upper limit to this and i suspect this is nvidia compromising on TDP in the short term to move faster on compute

Yizahi1y ago

So you are saying that Nvidia will finally force USA to the 220V standard? :)

SequoiaHope1y ago

I wonder if they will start putting lithium batteries in desktops so they can draw higher peak power.

Yes but the memory bandwidth of the 5090 is insanely high

Geee1y ago

Yeah, that's bullshit. I have a 3090 and I never want to use it at max power when gaming, because it becomes a loud space heater. I don't know what to do with 575W of heat.

Yeah. I've been looking at changing out my home lab GPU but I want low power and high ram. NVIDIA hasn't been catering to that at all. The new AMD APUs, if they can get their software stack to work right, would be perfect. 55w TDP and access to nearly 128GB, admittedly at 1/5 the mem bandwidth (which likely means 1/5 the real performance for tasks I am looking at but at 55w and being able to load 128g....)

blixt1y ago

Pretty interesting watching their tech explainers on YouTube about the changes in their AI solutions. Apparently they switched from CNNs to transformers for upscaling (with ray tracing support) if I understood correctly though for frame generation makes even more sense to me.

32 GB VRAM on the highest end GPU seems almost small after running LLMs with 128 GB RAM on the M3 Max, but the speed will most likely more than make up for it. I do wonder when we’ll see bigger jumps in VRAM though, now that the need for running multiple AI models at once seems like a realistic use case (their tech explainers also mentions they already do this for games).

terhechte1y ago

If you have 128gb ram, try running MoE models, they're a far better fit for Apple's hardware because they trade memory for inference performance. using something like Wizard2 8x22b requires a huge amount of memory to host the 176b model, but only one 22b slice has to be active at a time so you get the token speed of a 22b model.

FuriouslyAdrift1y ago

Project Digits... https://www.nvidia.com/en-us/project-digits/

I haven’t had great luck with the wizard as a counter point. The token generation is unbearably slow. I might have been using too large of a context window, though. It’s an interesting model for sure. I remember the output being decent. I think it’s already surpassed by other models like Qwen.

logankeenan1y ago

Do you have any recommendations on models to try?

cma1y ago

You can also run the experts on separate machines with low bandwidth networking or even the internet (token rate limited by RTT)

ActionHank1y ago

They are intentionally keeping the VRAM small on these cards to force people to buy their larger, more expensive offerings.

tbolt1y ago

Maybe, but if they strapped these with 64gb+ wouldn’t that be wasted on folks buying it for its intended purpose? Gaming. Though the “intended use” is changing and has been for a bit now.

If the VRAM wasn't small, the cards would all get routed to non gaming uses. Remember the state of the market when the 3000 series was new?

Havoc1y ago

Saw someone else point out that potentially the culprit here isn’t nvidia but memory makers. It’s still 2gb per chip and has been since forever

SideQuark1y ago

So you're saying more VRAM costs more money? What a novel idea!

Conversely, this means you can pay less if you need less.

Seems like a win all around.

vonneumannstan1y ago

No gamers need such high VRAM, if you're buying Gaming cards for ML work you're doing it wrong.

Totally agree. I call this the "Apple Model". Just like the Apple Mac base configurations with skimpy RAM and Drive capacities to make the price look "reasonable". However, just like Apple, NVIDIA does make really good hardware.

Well, they are gaming cards. 32GB is plenty for that.

marginalia_nu1y ago

Makes sense. The games industry doesn't want another crypto mining-style GPU shortage.

Is there actually less VRAM on the cards or is it just disabled?

bick_nyers1y ago

Check out their project digits announcement, 128GB unified memory with infiniband capabilities for $3k.

For more of the fast VRAM you would be in Quadro territory.

vonneumannstan1y ago

If you want to run LLMs buy their H100/GB100/etc grade cards. There should be no expectation that consumer grade gaming cards will be optimal for ML use.

Yes there should be. We don’t want to pay literal 10x markup because the card is suddenly “enterprise”.

throwaway3141551y ago

> There should be no expectation that consumer grade gaming cards will be optimal for ML use.

And yet it just so happens they work effectively the same. I've done research on an RTX 2070 with just 8 GB VRAM. That card consistently met or got close to the performance of a V100 albeit with less vram.

Why indicate people shouldn't use consumer cards? It's dramatically (like 10x-50x) cheaper. Is machine learning only for those who can afford 10k-50k USD workstation GPU's? That's lame and frankly comes across as gate keeping.

Honestly I can't really imagine how a person could reasonably have this stance. Just let folks buy hardware and use it however they want. Sure if may be less than optimal but it's important to remember that not everyone in the world has the money to afford an H100.

Perhaps you can explain some other better reason for why people shouldn't use consumer cards for ML? It's frankly kind of a rude suggestion in the absence of a better explanation.

quadrature1y ago

Why are transformers a better fit for frame generation. Is it because they can better utilize context from the previous history of frames ?

resource_waste1y ago

> after running LLMs with 128 GB RAM on the M3 Max,

These are monumentally different. You cannot use your computer as an LLM. Its more novelty.

I'm not even sure why people mention these things. Its possible, but no one actually does this out of testing purposes.

It falsely equates Nivida GPUs with Apple CPUs. The winner is Apple.

paxys1y ago

Even though they are all marketed as gaming cards, Nvidia is now very clearly differentiating between 5070/5070 Ti/5080 for mid-high end gaming and 5090 for consumer/entry-level AI. The gap between xx80 and xx90 is going to be too wide for regular gamers to cross this generation.

ziml771y ago

The 4090 already seemed positioned as a card for consumer AI enthusiast workloads. But this $1000 price gap between the 5080 and 5090 seems to finally cement that. Though we're probably still going to see tons of tech YouTubers making videos specifically about how the 5090 isn't a good value for gaming as if it even matters. The people who want to spend $2000 on a GPU for gaming don't care about the value and everyone else already could see it wasn't worth it.

dijit1y ago

From all the communication I’ve had with Nvidia, the prevailing sentiment was that the 4090 was an 8K card, that happened to be good for AI due to vram requirements from 8K gaming.

However, I’m a AAA gamedev CTO and they might have been telling me what the card means to me.

ryao1y ago

If I recall correctly, the 3090, 3090 Ti and 4090 were supposed to replace the Titan cards that had been Nvidia's top gaming cards, but were never meant for gaming.

angled1y ago

I wonder if these will be region-locked (eg, not for HK SAR).

oliwarner1y ago

The only difference is scalar. That isn't differentiating, that's segregation.

It won't stop crypto and LLM peeps from buying everything (one assumes TDP is proportional too). Gamers not being able to find an affordable option is still a problem.

officeplant1y ago

>Gamers not being able to find an affordable option is still a problem.

Used to think about this often because I had a side hobby of building and selling computers for friends and coworkers that wanted to get into gaming, but otherwise had no use for a powerful computer.

For the longest time I could still put together $800-$1000 PC's that could blow consoles away and provide great value for the money.

Now days I almost want to recommend they go back to console gaming. Seeing older ps5's on store shelves hit $349.99 during the holidays really cemented that idea. Its so astronomically expensive for a PC build at the moment unless you can be convinced to buy a gaming laptop on a deep sale.

foobarian1y ago

Are crypto use cases still there? I thought that went away after eth switched their proof model.

kcb1y ago

Yup, the days of the value high end card are dead it seems like. I thought we would see a cut down 4090 at some point last generation but it never happened. Surely there's a market gap somewhere between 5090 and 5080.

ryao1y ago

The xx90 cards are really Titan cards. The 3090 was the successor to the Titan RTX, while the 3080 Ti was the successor to the 2080 Ti, which succeeded the 1080 Ti. This succession continued into the 40 series and now the 50 series. If you consider the 2080 Ti to be the "value high end card" of its day, then it would follow that the 5080 is the value high end card today, not the 5090.

smallmancontrov1y ago

Yes, but Nvidia thinks enough of them get pushed up to the 5090 to make the gap worthwhile.

Only way to fix this is for AMD to decide it likes money. I'm not holding my breath.

epolanski1y ago

You underestimate how many gamers got a 4090.

simondotau1y ago

Nvidia is also clearly differentiating the 5090 as the gaming card for people who want the best and an extra thousand dollars is a rounding error. They could have sold it for $1500 and still made big coin, but no doubt the extra $500 is pure wealth tax.

It probably serves to make the 4070 look reasonably priced, even though it isn't.

epolanski1y ago

Gaming enthusiasts didn't beat an eye at 4090 price and won't beat one there either.

4090 was already priced for high income (in first world countries) people. Nvidia saw 4090s were being sold on second hand market way beyond 2k. They merely milking the cow.

sliken1y ago

Double the bandwidth, double the ram, double the pins, and double the power isn't cheap. I wouldn't be surprised if the profit on the 4090 was less than the 4080, especially since any R&D costs will be spread over significantly less units.

ryao1y ago

Leaks indicate that the PCB has 14 layers with a 512-bit memory bus. It also has 32GB of GDDR7 memory and the die size is expected to be huge. This is all expensive. Would you prefer that they had not made the card and instead made a lesser card that was cheaper to make to avoid the higher price? That is the AMD strategy and they have lower prices.

ffsm81y ago

The price of a 4090 already was ~1800-2400€ where I live (not scalper prices, the normal online Shops)

We'll have to see how much they'll charge for these cards this time, but I feel like the price bump has been massively exaggerated by people on HN

MSRP went from 1959,- to 2369,-. That's quite the increase.

lz4001y ago

How will a 5090 compare against project digits? now that they're both in the front page :)

ryao1y ago

We will not really know until memory bandwidth and compute numbers are published. However, Project Digits seems like a successor to the NVIDIA Jetson AGX Orin 64GB Developer Kit, which was based on the Ampere architecture and has 204.8GB/sec memory bandwidth:

https://www.okdo.com/wp-content/uploads/2023/03/jetson-agx-o...

The 3090 Ti had about 5 times the memory bandwidth and 5 times the compute capability. If that ratio holds for blackwell, the 5090 will run circles around it when it has enough VRAM (or you have enough 5090 cards to fit everything into VRAM).

Don't forget that you can link for example two 'Digits' together (~256 GB) if you want to run even larger models or have larger context size. That is 2x$3000 vs 8x$2000.

This will make it possible for you to run models up to 405B parameters, like Llama 3.1 405B at 4bit quant or the Grok-1 314B at 6bit quant.

Who knows, maybe some better models will be released in the future which are better optimized and won't need that much RAM, but it is easier to buy a second 'Digits' in comparison to building a rack with 8xGPUs. For example, if you look at the latest Llama models, Meta states: 'Llama 3.3 70B approaches the performance of Llama 3.1 405B'.

To interfere with Llama3.3-70B-Instruct with ~8k context length (without offloading), you'd need: - Q4 (~44GB): 2x5090; 1x 'Digits' - Q6 (~58GB): 2x5090; 1x 'Digits' - Q8 (~74GB): 3x5090; 1x 'Digits' - FP16 (~144GB): 5x5090; 2x 'Digits'

Let's wait and see which bandwidth it will have.

ncr1001y ago

Kind of wondering if nVidia will pull a Dell and copy Apple renaming

5070, 5070 Ti, 5080, 5090 to

5000, 5000 Plus, 5000 Pro, 5000 Pro Max.

:O

ryao1y ago

The 3090 and 3090 Ti both support software ECC. I assume that the 4090 has it too. That alone positions the xx90 as a pseudo-professional card.

The 4090 indeed does have ecc support

whalesalad1y ago

It’s the same pricing from last year. This already happened.

32GB of GDDR7 at 1.8TB/sec for $2000, best of luck to the gamers trying to buy one of those while AI people are buying them by the truckload.

Presumably the pro hardware based on the same silicon will have 64GB, they usually double whatever the gaming cards have.

Hilift1y ago

100% you will be able to buy them. And receive a rock in the package from Amazon.

At what point do we stop calling them graphics cards?

WillPostForFood1y ago

We've looped back to the "math coprocessor" days.

https://en.wikipedia.org/wiki/Coprocessor

avaer1y ago

At what point did we stop calling them phones?

paxys1y ago

Nvidia literally markets H100 as a "GPU" (https://www.nvidia.com/en-us/data-center/h100/) even though it wasn't built for graphics and I doubt there's a single person or company using one to render any kind of graphics. GPU is just a recognizable term for the product category, and will keep being used.

nickpsecurity1y ago

It's a good question. I'll note that, even in the GPGPU days (eg BrookGPU), they were architecturally designed for graphics applications (eg shaders). The graphics hardware was being re-purposed to do something else. It was quite a stretch to do the other things compared to massively-parallel, general-purpose designs. They started adding more functionality to them, like physics. Now, tensors.

While they've come a long way, I'd imagine they're still highly specialized compared to general-purpose hardware and maybe still graphics-oriented in many ways. One could test this by comparing them to SGI-style NUMA machines, Tilera's tile-based systems, or Adapteva's 1024-core design. Maybe Ambric given it aimed for generality but Am2045's were DSP-style. They might still be GPU's if they still looked more like GPU's side by side with such architectures.

MuffinFlavored1y ago

How long until a "PC" isn't CPU + GPU but just a GPU? I know CPUs are good for some things that GPUs aren't and vice versa but... it really kind of makes you wonder.

Press the power button, boot the GPU?

Surely a terrible idea, and I know system-on-a-chip makes this more confusing/complicated (like Apple Silicon, etc.)

I mean HPC people already call them accelerators

ryao1y ago

Do they double it via dual rank or clamshell mode? It is not clear which approach they use.

wruza1y ago

Why do you need one of those as a gamer? 1080ti was 120+ fps in heavy realistic looking games. 20xx RT slashed that back to 15 fps, but is RT really necessary to play games? Who cares about real-world reflections? And reviews showed that RT+DLSS introduced so many artefacts sometimes that the realism argument seemed absurd.

Any modern card under $1000 is more than enough for graphics in virtually all games. The gaming crisis is not in a graphics card market at all.

agloe_dreams1y ago

A bunch of new games are RT-only. Nvidia has aggressively marketed on the idea that RT, FG, and DLSS are "must haves" in game engines and that 'raster is the past'. Resolution is also a big jump. 4K 120Hz in HDR is rapidly becoming common and the displays are almost affordable (esp. so for TV-based gaming). In fact, as of today, Even the very fastest RTX 4090 cannot run CP2077 at max non-RT settings and 4K at 120fps.

Now, I do agree that $1000 is plenty for 95% of gamers, but for those who want the best, Nvidia is pretty clearly holding out intentionally. The gap between a 4080TI and a 4090 is GIANT. Check this great comparison from Tom's Hardware: https://cdn.mos.cms.futurecdn.net/BAGV2GBMHHE4gkb7ZzTxwK-120...

The biggest next-up offering leap on the chart is 4090.

nullandvoid1y ago

Many people are running 4k resolution now, and a 4080 struggles to to break 100 frames in many current games maxed (never-mind future titles) - therefore there's plenty of a market with gamers and the 5x series (myself included) who are looking for closer to 4090 performance at a non obscene price.

rane1y ago

1080ti is most definitely not powerful enough to play modern games at 4k 120hz.

> is RT really necessary to play games? Who cares about real-world reflections?

I barely play video games but I definitely do

> Any modern card under $1000 is more than enough for graphics in virtually all games

I disagree. I run a 4070 Super, Ryzen 7700 with DDR5 and I still cant run Asseto Corsa Competizione in VR at 90fps. MSFS 2024 runs at 30 something fps at medium settings. VR gaming is a different beast

t-writescode1y ago

> Who cares about real-world reflections?

Me. I do. I *love* raytracing; and, as has been said and seen for several of the newest AAA games, raytracing is no longer optional for the newest games. It's required, now. Those 1080s, wonderful as long as they have been (and they have been truly great cards) are definitely in need of an upgrade now.

You need as much FPS as possible for certain games for competitive play like Counter Strike.

I went from 80 FPS (highest settings) to 365 FPS (capped to my alienware 360hz monitor) when I upgraded from my old rig (i7-8700K and 1070GTX) to a new one ( 7800X3D and 3090 RTX)

ryao1y ago

> Any modern card under $1000 is more than enough for graphics in virtually all games. The gaming crisis is not in a graphics card market at all.

You will love the RTX 5080 then. It is priced at $999.

some_random1y ago

It's a leisure activity, "necessary" isn't the metric to be used here, people clearly care about RT/PT while DLSS seems to be getting better and better.

ErneX1y ago

These are perfect for games featuring path tracing. Not many games though but those really flex the 4090.

berbec1y ago

I get under 50fps in certain places in FF14. I run a 5900x with 32GB of ram and a 3090.

ryao1y ago

The most interesting news is that the 5090 Founders' Edition is a 2-slot card according to Nvidia's website:

https://www.nvidia.com/en-us/geforce/graphics-cards/50-serie...

When was the last time Nvidia made a high end GeForce card use only 2 slots?

Fantastic news for the SFF community.

(Looks like Nvidia even advertises an "SFF-Ready" label for cards that are small enough: https://www.nvidia.com/en-us/geforce/news/small-form-factor-...)

It's a dual flow-through design, so some SFF cases will work OK but the typical sandwich style ones probably won't even though it'll physically fit

sliken1y ago

Not really, 575 watts for the GPU is going to make it tough to cool or provide power for.

Donno why I feel this, but probably going to end up being 2.5 slots

matja1y ago

The integrator decides the form factor, not NVIDIA, and there were a few 2-slot 3080's with blower coolers. Technically water-cooled 40xx's can be 2-slot also but that's cheating.

favorited1y ago

40-series water blocks can even be single slot: https://shop.alphacool.com/en/shop/gpu-water-cooling/nvidia/...

> will be two times faster [...] thanks to DLSS 4

Translation: No significant actual upgrade.

Sounds like we're continuing the trend of newer generations being beaten on fps/$ by the previous generations while hardly pushing the envelope at the top end.

A 3090 is $1000 right now.

williamDafoe1y ago

It looks like the new cards are NO FASTER than the old cards. So they are hyping the fake frames, fake pixels, fake AI rendering. Anything fake = good, anything real = bad.

Jensen thinks that "Moore's Law is Dead" and it's just time to rest and vest with regards to GPUs. This is the same attitude that Intel adopted 2013-2024.

piyh1y ago

Why are you upset how a frame is generated? We're not talking about free range versus factory farming. Here, a frame is a frame and if your eye can't tell the difference then it's as good as any other.

Why is that a problem though? Newer and more GPU intensive games get to benefit from DLSS 4 and older games already run fine. What games without DLSS support could have done with a boost?

I've heard this twice today so curious why it's being mentioned so often.

Yizahi1y ago

I also like DLSS, but the OP is correct that it is a problem. Specifically it's a problem with understanding what are these cards capable of. Theoretically we would like to see separately performance with no upscaling at all, then separately with different levels of upscaling. Then we would be able to see easier what is the real performance boost of the hardware, and of the upscaler separately.

It's like BMW comparing new M5 model to the previous gen M5 model, while previous gen is on the regular 95 octane, and new gen is on some nitromethane boosted custom fuel. With no information how fast the new car is on a regular fuel.

I for one don't like the DLSS/TAA look at all. Between the lack of sharpness, motion blur and ghosting, I don't understand how people can look at that and consider it an upgrade. Let's not even get into the horror that is frame generation. They're a graphics downgrade that gives me a headache and I turn the likes of TAA and DLSS off in every game I can. I'm far from alone in this.

So why should we consider to buy a GPU at twice the price when it has barely improved rasterization performance? An artificially generation-locked feature anyone with good vision/perception despises isn't going to win us over.

epolanski1y ago

We all know DLSS4 could be compatible with previous gens.

Nvidia has done that in the past already (see PhysX).

Diti1y ago

> What games without DLSS support could have done with a boost?

DCS World?

m3kw91y ago

5090 has 2x the core, higher frequencies, 3x flops. You got to do some dd before talking

edm0nd1y ago

>A 3090 is $1000 right now.

Not really worth it if you can get a 5090 for $1,999

If you can get a 5090 for that price, I'll eat my hat. scalpers with their armies of bots will buy them all before you get a chance.

Saving $1000 for only a ~25-30% hit in rasterization perf is going to be worth it for a lot of people.

thefz1y ago

> GeForce RTX 5070 Ti: 2X Faster Than The GeForce RTX 4070 Ti

2x faster in DLSS. If we look at the 1:1 resolution performance, the increase is likely 1.2x.

That's what I'm wondering. What's the actual raw render/compute difference in performance, if we take a game that predates DLSS?

thefz1y ago

We shall wait for real world benchmarks to address the raster performance increase.

The bold claim "5070 is like a 4090 at 549$" is quite different if we factor in that it's basically in DLSS only.

izacus1y ago

Based on non-DLSS tests, it seems like a respectable ~25%.

Let's see the new version of frame generation. I enabled DLSS frame generation on Diablo 4 using my 4060 and I was very disappointed with the results. Graphical glitches and partial flickering made the game a lot less enjoyable than good old 60fps with vsync.

ziml771y ago

The new DLSS 4 framegen really needs to be much better than what's there in DLSS 3. Otherwise the 5070 = 4090 comparison won't just be very misleading but flatly a lie.

sliken1y ago

Seems like pretty heavily stretched truth. Looks like the actual performance uplift is more like 30%. The 5070=4090 comes from generating multiple fake frames per actual frame and using different versions of DLSS on the cards. Multiple frame generation (required for 5070=4090) increases latency between user input and updated pixels and can also cause artifacts when predictions don't match what the game engine would display.

As always wait for fairer 3rd party reviews that will compare new gen cards to old gen with the same settings.

There's some very early coverage on Digital Foundry where they got to look at the 5080 and Cyberpunk.

https://youtu.be/xpzufsxtZpA

evantbyrne1y ago

The main edge Nvidia has in gaming is ray tracing performance. I'm not playing any RT heavy titles and frame gen being a mixed bag is why I saved my coin and got a 7900 XTX.

lxdlam1y ago

I have a serious question about the term "AI TOPS". I find many conflicting definitions while others say nothing. A meaningful metric should at least be well defined on its own term, like in "TOPS" or expanded "Tera Operations Per Second", what operation it will measure?

Seemingly NVIDIA is just playing number games, like wow 3352 is a huge leap compared to 1321 right? But how does it really help us in LLMs, diffusion models and so on?

diggan1y ago

It would be cool if something like vast.ai's "DLPerf" would become popular enough for the hardware producers to start using it too.

> DLPerf (Deep Learning Performance) - is our own scoring function. It is an approximate estimate of performance for typical deep learning tasks. Currently, DLPerf predicts performance well in terms of iters/second for a few common tasks such as training ResNet50 CNNs. For example, on these tasks, a V100 instance with a DLPerf score of 21 is roughly ~2x faster than a 1080Ti with a DLPerf of 10. [...] Although far from perfect, DLPerf is more useful for predicting performance than TFLops for most tasks.

https://vast.ai/faq#dlperf

az2261y ago

We don’t need this. We can easily unpack Nvidia’s marketing bullshit.

5090 is 26% higher flops than 4090, at 28% higher power draw, and 25% higher price.

az2261y ago

The 5090 TOPS number is with sparsity at 4bits, so it doubles the value compared to the 8bit sparse number for 4090.

The real jump is 26%, at 28% higher power draw and 25% higher price.

A dud indeed.

malnourish1y ago

I will be astonished if I'll be able to get a 5090 due to availability. The 5080's comparative lack of memory is a buzzkill -- 16 GB seems like it's going to be a limiting factor for 4k gaming.

Does anyone know what these might cost in the US after the rumored tariffs?

stego-tech1y ago

Honestly, with how fast memory is being consumed nowadays and the increased focus on frame generation/interpolation vs “full frames”, I’ll keep my 3090 a little longer instead of upgrading to a 5080 or 5090. It’s not the fastest, but it’s a solid card even in 2025 for 1440p RT gaming on a VRR display, and the memory lets me tinker with LLMs without breaking a sweat.

If DLSS4 and “MOAR POWAH” are the only things on offer versus my 3090, it’s a hard pass. I need efficiency, not a bigger TDP.

ryao1y ago

Pricing for the next generation might be somewhat better if Nvidia switches to Samsung for 2nm like the rumors suggest:

https://wccftech.com/nvidia-is-rumored-to-switch-towards-sam...

Coincidentally, the 3090 was made using Samsung's 8nm process. You would be going from one Samsung fabricated GPU to another.

ziml771y ago

Efficiency is why I switched from a 3090 to a 4080. The amount of heat generated by my PC was massively reduced with that change. Even if the xx90 weren't jumping up in price each generation, I wouldn't be tempted to buy one again (I didn't even really want the 3090, but that was during the supply shortages and it was all I could get my hands on).

lemoncookiechip1y ago

DLSS4 is coming to other RTX cards, eventually. https://www.nvidia.com/en-us/geforce/news/dlss4-multi-frame-...

DimmieMan1y ago

I use my 3090 on a 4K TV and still don't see a need, although a lot of that is being bored with most big budget games so I don't have many carrots to push me to upgrade.

Turn down a few showcase features and games still look great and run well with none or light DLSS. UE5 Lumen/ray tracing are the only things I feel limited on and until consoles can run them they'll be optional.

It seems all the gains are brute forcing these features with upscaling & frame generation which I'm not a fan of anyway.

Maybe a 7090 at this rate for me.

ericfrederich1y ago

4k gaming is dumb. I watched a LTT video that came out today where Linus said he primarily uses gaming monitors and doesn't mess with 4k.

kcb1y ago

No it's not. 2560x1440 has terrible PPI on larger screens. Either way with a 4k monitor you don't technically need to game at 4k as most intensive games offer DLSS anyway.

zeroonetwothree1y ago

Yep. I have both 4k and 1440p monitors and I can’t tell the difference in quality so I always use the latter for better frames. I use the 4k for reading text though, it’s noticeably better.

Our_Benefactors1y ago

There are good 4K gaming monitors, but they start at over $1200 and if you don't also have a 4090 tier rig, you won’t be able to get full FPS out of AAA games at 4k.

valzam1y ago

Almost no one plays on native 4k anyway. DLSS Quality (no framegen etc) renders at 1440p internally and by all accounts there is no drawback at all, especially above 60fps. Looks great, no noticeable (excluding super sweaty esports titles) lag and 30% more performance. Combined with VRR displays, I would say 4k is perfectly ok for gaming.

akimbostrawman1y ago

Taking anything Linus or LTT says seriously is even dumber....

ggregoire1y ago

I watched the same video you talking about [1], where he's trying the PG27UCDM (new 27" 4K 240Hz OLED "gaming monitor" [2]) and his first impressions are "it's so clean and sharp", then he starts Doom Eternal and after a few seconds he says "It's insane [...] It looks perfect".

[1] https://www.youtube.com/watch?v=iQ404RCyqhk

[2] https://rog.asus.com/monitors/27-to-31-5-inches/rog-swift-ol...

Yeul1y ago

Nonsense 4k gaming was inevitable as soon as 4k TVs got mainstream.

Looks like a bit dud, though given their competition and where their focus is right now maybe expected.

Going from 60 to 120fps is cool. Going from 120fps to 240fps is in the realm of diminishing returns, especially because the added latency makes it a non starter for fast paced multiplayer games.

12GB VRAM for over $500 is an absolute travesty. Even today cards with 12GB struggle in some games. 16GB is fine right now, but I'm pretty certain it's going to be an issue in a few years and is kind of insane at $1000. The amount of VRAM should really be double of what it is across the board.

PaulKeeble1y ago

Looks like most of the improvement is only going to come when DLSS4 is in use and its generating most of the frame for Ray Tracing and then also generating 3 predicted frames. When you use all that AI hardware then its maybe 2x, but I do wonder how much fundamental rasterisation + shaders performance gain there is in this generation in practice on the majority of actual games.

There was some solid commentary on the Ps5Pro tech talk stating core rendering is so well optimized much of the gains in the future will come from hardware process technology improvements not from radical architecture changes. It seems clear the future of rendering is likely to be a world where the gains come from things like dlss and less and free lunch savings due to easy optimizations.

jayd161y ago

Nanite style rendering still seems fairly green. That could take off and they decide to re-implement the software rasterization in hardware.

> but I do wonder how much fundamental rasterisation + shaders performance gain there is in this generation in practice on the majority of actual games.

likely 10-30% going off of both the cuda core specs (nearly unchanged gen/gen for everything but the 5090) as well as the 2 benchmarks Nvidia published that didn't use dlss4 multi frame gen - Far Cry 6 & A Plague Tale

https://www.nvidia.com/en-us/geforce/graphics-cards/50-serie...

williamDafoe1y ago

Given that Jensen completely omitted ANY MENTION of rasterization performance, I think we can safely assume it's probably WORSE in the 5000 series than the 4000 series, given the large price cuts applied to every card below then 5090 (NVidia was never happy charging $1000 for the 4080 super - AMD forced them to do it with the 7900xtx).

yakaccount41y ago

3 Generated frames sounds like a lot of lag, probably a sickening amount for many games. The magic of "blackwell flip metering" isn't quite described yet.

dagmx1y ago

It’s 3 extrapolated frames not interpolated. So would be reduced lag at the expense of greater pop-in.

There’s also the new reflex 2 which uses reprojection based on mouse motion to generate frames that should also help, but likely has the same drawback.

DimmieMan1y ago

Yeah I’m not holding my breath if they aren’t advertising it.

I’m expecting a minor bump that will look less impressive if you compare it to watts, these things are hungry.

It’s hard to get excited when most of the gains will be limited to a few new showcase AAA releases and maybe an update to a couple of your favourites if your lucky.

coffeebeqn1y ago

It feels like GPUs are now well beyond what game studios can put out. Consoles are stuck at something like RTX 2070 levels for some years still. I hope Nvidia puts out some budget cards for 50 series

WeylandYutani1y ago

Like with how you cannot distinguish reality from CGI in movies DLSS will also become perfected over the years.

perching_aix1y ago

I guess to everyone with working eyes this means DLSS will never be perfect. I agree.

janalsncm1y ago

I have trained transformers on a 4090 (not language models). Here’s a few notes.

You can try out pretty much all GPUs on a cloud provider these days. Do it.

VRAM is important for maxing out your batch size. It might make your training go faster, but other hardware matters too.

How much having more VRAM speeds things up also depends on your training code. If your next batch isn’t ready by the time one is finished training, fix that first.

Coil whine is noticeable on my machine. I can hear when the model is training/next batch is loading.

Don’t bother with the founder’s edition.

magicalhippo1y ago

Thanks for sharing your insights, was thinking of upgrading to a 5090 partially to dabble with NNs.

> Don’t bother with the founder’s edition.

Why?

It's a shame to see they max out at just 32GB, for that price in 2025 you'd be hoping for a lot more, especially with Apple Silicon - while not nearly as fast - being very usable with 128GB+ for LLMs for $6-7k USD (comes with a free laptop too ;))

Apple Silicons architecture is better for running huge AI models but much worse for just about anything else that you'd want to run on a GPU, bandwidth is far more important in most other applications.

That's not even close, the M4 Max 12C has less than a third of the 5090s memory throughput and the 10C version has less than a quarter. The M4 Ultra should trade blows with the 4090 but it'll still fall well short of the 5090.

ryao1y ago

Presumably the workstation version will have 64GB of VRAM.

By the way, this is even better as far as memory size is concerned:

https://www.asrockrack.com/minisite/AmpereAltraFamily/

However, memory bandwidth is what matters for token generation. The memory bandwidth of this is only 204.8GB/sec if I understand correctly. Apple's top level hardware reportedly does 800GB/sec.

sliken1y ago

AMD Strix Halo is 256GB/sec or so. Similarly AMD's Epyc Sienna family is similar. The EPYC turin family (zen 5) has 576GB/sec or so per socket. Not sure how well any of them do on LLMs. Bandwidth helps, but so does hardware support for FP8 or FP4.

All of this is true only while no software is utilizing parallel inference of multiple LLM queries. The Macs will hit the wall.

whywhywhywhy1y ago

Just isn't comparable speed wise for anything apart from LLM and in the long run you can double up and swap out Nvidia cards while Mac you need to rebuy the whole machine.

FuriouslyAdrift1y ago

Guess you missed the Project Digits announcement... desktop supercomputer for AI at $3k (128 GB ram)

https://www.nvidia.com/en-us/project-digits/

voidUpdate1y ago

Ooo, that means its probably time for me to get a used 2080, or maybe even a 3080 if I'm feeling special

Macha1y ago

The 2080 was a particularly poor value card, especially when considering the small performance uplift and the absolute glut of 1080 Tis that were available. A quick look on my local ebay also indicates they're both around the €200-250 range for used buy it now, so it seems to make way more sense to go to a 3080.

qingcharles1y ago

2080 TI though is a really good sweet spot for price/performance.

Kelteseth1y ago

Why not go for AMD? I just got a 7900XTX for 850 euros, it runs ollama or comfyUI via WSl2 quite nicely.

williamDafoe1y ago

AMD is an excellent choice. NVidia UI has been horrible and AMD adrenaline has been better than NVidia for several years now. With NVidia, you are paying A LOT of extra money for trickery and fake pixels, fake frames, fake (ai) rendeering. All fakeness. All hype. When you get down to the raw performance of these new cards, it must be a huge disappointment, otherwise, why would Jensen completely forget to mention anything REAL about the performance of these cards? These are cut-down cards designed to sell at cut-down prices with lots of fluff and whipped cream added on top ...

viraj_shah1y ago

Do you have a good resource for learning what kinds of hardware can run what kinds of models locally? Benchmarks, etc?

I'm also trying to tie together different hardware specs to model performance, whether that's training or inference. Like how does memory, VRAM, memory bandwidth, GPU cores, etc. all play into this. Know of any good resources? Oddly enough I might be best off asking an LLM.

satvikpendem1y ago

DLSS is good and keeps improving, as with DLSS 4 where most of the features are compatible with even the 2000 series cards. AMD does not have the same software feature set to justify a purchase.

orphea1y ago

AMD driver quality is crap. I upgraded from GTX 1080 to RX 6950 XT because I found a good deal and I didn't want to support nvidia's scammy bullshit of launching inferior GPUs under the same names. Decided to go with AMD this time, and I had everything: black screens, resolution drops to 1024x768, total freezes, severe lags in some games (BG3) unless I downgrade the driver to a very specific version.

whywhywhywhy1y ago

Pointless putting yourself through the support headaches or having to wait for support to arrive to save a few dollars because the rest of the community is running Nvidia

vonneumannstan1y ago

a 4070 has much better performance for much cheaper than a 3080...

rtkwe1y ago

Any the 4070 Super is relatively available too. I just bought one with only a small amount of hunting. Bought it right off of Best Buy, originally tried going to the Microcenter near my parent's house while I was down there but should have bought the card online for pickup. In the 2 days between my first check and arriving at the store ~20 cards sold.

jmyeet1y ago

The interesting part to me was that Nvidia claim the new 5070 will have 4090 level performance for a much lower price ($549). Less memory however.

If that holds up in the benchmarks, this is a nice jump for a generation. I agree with others that more memory would've been nice, but it's clear Nvidia are trying to segment their SKUs into AI and non-AI models and using RAM to do it.

That might not be such a bad outcome if it means gamers can actually buy GPUs without them being instantly bought by robots like the peak crypto mining era.

dagmx1y ago

That claim is with a heavy asterisk of using DLSS4. Without DLSS4, it’s looking to be a 1.2-1.3x jump over the 4070.

knallfrosch1y ago

Do games need to implement something on their side to get DLSS4?

Was surprised to relearn the GTX 980 premiered at $549 a decade ago.

izacus1y ago

Which is 750$ in 2024 adjusted for inflation and you got a card that's providing 1/3 of performance of a 4070Ti at equal price range. 1/4 with 5070Ti probably.

3x the FPS at same cost (ignoring AI cores, encoders, resolutions, etc.) is a decent performance track record. With DLSS enabled the difference is significantly bigger.

friedtofu1y ago

As a lifelong nvidia consumer, I think it's a safe bet to ride out the first wave of 5xxx series GPUs and wait for the inevitable 5080/5070 (GT/Ti/Super/whatever) that should release a few months after with similar specs and better performance based on whatever the complaints surrounding the initial GPUs lacked.

I would expect something like the 5080 super will have something like 20/24Gb of VRAM. 16Gb just seems wrong for their "target" consumer GPU.

knallfrosch1y ago

Or you wait out the 5000 Super too and get the 6000 series that fixes all the first-gen 5000-Super problems...

ryao1y ago

They could have used 32Gbps GDDR7 to push memory bandwidth on the 5090 to 2.0TB/sec. Instead, they left some performance on the table. I wonder if they have some compute cores disabled too. They are likely leaving room for a 5090 Ti follow-up.

nsteel1y ago

Maybe they wanted some thermal/power headroom. It's already pretty mad.

I made the mistake of not waiting befpre.

This time around, I will save for the 5090 or just wait for the Ti/Super refreshes.

valzam1y ago

A few months? Didn't the 4080 Super release at least a few years after the 4080?

geertj1y ago

Any advice on how to buy the founders edition when it launches, possibly from folks who bought the 4090 FE last time around? I have a feeling there will be a lot of demand.

logicalfails1y ago

Getting a 3080 FE (I also had the option to get the 3090 FE) at the height of pandemic demand required me sleeping outside a Best Buy with 50 other random souls on a wednesday night.

steelframe1y ago

At that time I ended up just buying a gaming PC packaged with the card. I find it's generally worth it to upgrade all the components of the system along with the GPU every 3 years or so.

jmuguy1y ago

Do you live somewhat near a Microcenter? They'll likely have these as in-store pick up only, no online reservations, 1 per customer. Recently got a 9800X3D CPU from them, its nice they're trying to prevent scalping.

geertj1y ago

I do! Great advice. Going off on a tangent, when I recently visited my Microcenter after a few years of not going there, it totally gave me 80s vibes and I loved it. Staff fit the "computer nerd" stereotype accurately, including jeans shirts and ponytails. And best of all they actually wanted to talk to me and help me find stuff, and were knowledgeable.

As someone living (near) Seattle, this is a major issue for me every product launch and I don't have a solution.

The area's geography just isn't conducive to allowing a single brick and mortar store to survive and compete with online retail for costs vs volume; but without a B&M store there's no good way to do physical presence anti-scalper tactics.

I can't even get in a purchase opportunity lottery since AMD / Nvidia don't do that sort of thing for allocating restock quota tickets that could be used as tokens to restock product if a purchase is to the correct shipping address.

satvikpendem1y ago

There is a Discord that sends notifications for stock drops. They are also on X.

https://discord.com/invite/stockdrops

https://x.com/stock_drops?lang=en

jms551y ago

* MegaGeometry (APIs to allow Nanite-like systems for raytracing) - super awesome, I'm super super excited to add this to my existing Nanite-like system, finally allows RT lighting with high density geometry

* Neural texture stuff - also super exciting, big advancement in rendering, I see this being used a lot (and helps to make up for the meh vram blackwell has)

* Neural material stuff - might be neat, Unreal strata materials will like this, but going to be a while until it gets a good amount of adoption

* Neural shader stuff in general - who knows, we'll see how it pans out

* DLSS upscaling/denoising improvements (all GPUs) - Great! More stable upscaling and denoising is very much welcome

* DLSS framegen and reflex improvements - bleh, ok I guess, reflex especially is going to be very niche

* Hardware itself - lower end a lot cheaper than I expected! Memory bandwidth and VRAM is meh, but the perf itself seems good, newer cores, better SER, good stuff for the most part!

Note that the material/texture/BVH/denoising stuff is all research papers nvidia and others have put out over the last few years, just finally getting production-ized. Neural textures and nanite-like RT is stuff I've been hyped for the past ~2 years.

I'm very tempted to upgrade my 3080 (that I bought used for $600 ~2 years ago) to a 5070 ti.

magicalhippo1y ago

For gaming I'm also looking forward to the improved AI workload sharing mentioned, where, IIUC, AI and graphics workloads could operate at the same time.

I'm hoping generative AI models can be used to generate more immersive NPCs.

lemoncookiechip1y ago

I have a feeling regular consumers will have trouble buying 5090s.

RTX 5090: 32 GB GDDR7, ~1.8 TB/s bandwidth. H100 (SXM5): 80 GB HBM3, ~3+ TB/s bandwidth.

RTX 5090: ~318 TFLOPS in ray tracing, ~3,352 AI TOPS. H100: Optimized for matrix and tensor computations, with ~1,000 TFLOPS for AI workloads (using Tensor Cores).

RTX 5090: 575W, higher for enthusiast-class performance. H100 (PCIe): 350W, efficient for data centers.

RTX 5090: Expected MSRP ~$2,000 (consumer pricing). H100: Pricing starts at ~$15,000–$30,000+ per unit.

boroboro41y ago

H100 has 3958 TFLOPS sparse fp8 compute. I’m pretty sure listed tflops for 5090 are sparse (and probably) fp4/int4.

rfoo1y ago

Yes, that's the case. Check the (partial) spec of 5090 D, which is the nerfed version for export to China. It is marketed as having 2375 "AI TOPS".

BIS demands it to be less than $4800 TOPS \times Bit-Width$, and the most plausible explanation for the number is - 2375 sparse fp4/int4 TOPS, which means 1187.5 dense TOPS for 4 bit, or $4750 TOPS \times Bit-Width$.

boroboro41y ago

And just for the context RTX 4090 has 2642 sparse int4 TOPS, so it’s about 25% increase

bee_rider1y ago

How well do these models do at parallelizing across multiple GPUs? Is spending $4k on the 5090 a good idea for training, slightly better performance for much cheaper? Or a bad idea, 0x as good performance because you can’t fit your 60GB model on the thing?

Havoc1y ago

> regular consumers will have trouble buying 5090s.

They’re not really supposed to either judging by how they priced this. For non AI uses the 5080 is infinitely better positioned

> For non AI uses the 5080 is infinitely better positioned

...and also slower than a 4090. Only the 5090 got a gen/gen upgrade in shader counts. Will have to wait for benchmarks of course, but the rest of the 5xxx lineup looks like a dud

topherjaynes1y ago

That's my worry too, I'd like one or two, but 1) will either never be in line for them 2) or can only find via secondary market at 3 or 4x the price...

ksec1y ago

Anyone has any info on Node? Can't find anything online. Seems to be 4nm but performance suggest otherwise. Hopefully someone do a deep dive soon.

kcb1y ago

Good bet it's 4nm. The 5090 doesn't seem that much greater than the 4090 in terms of raw performance. And it has a big TDP bump to provide that performance.

TSMC 4NP process

Source: https://www.nvidia.com/en-us/data-center/technologies/blackw...

wmf1y ago

I'm guessing it's N4 and the performance is coming from larger dies and higher power.

sashank_15091y ago

Does any game need 32gb VRAM. Did they even use the full 24Gb of the 4090s?

It seems obvious to me that even NVIDIA knows that 5090s and 4090s are used more for AI Workloads than gaming. In my company, every PC has 2 4090s, and 48GB is not enough. 64GB is much better, though I would have preferred if NVIDIA went all in and gave us a 48GB GPU, so that we could have 96GB workstations at this price point without having to spend 6k on an A6000.

Overall I think 5090 is a good addition to the quick experimentation for deep learning market, where all serious training and inference will occur on cloud GPU clusters, but we can still do some experimentation on local compute with the 5090.

nullc1y ago

Way too little memory. :(

supermatt1y ago

Can anyone suggest a reliable way to procure a GPU at launch (in the EU)?

I always end up late to the party and the prices end up being massively inflated - even now I cant seem to buy a 4090 for anywhere close to the RRP.

575W TDP for the 5090. A buddy has 3x 4090 in a machine with a 32 core AMD cpu must be putting out close to 2000W of heat at peak if he switched to 5090. Uff

I have a very similar setup, 3x4090s. Depending on the model I’m training, the GPUs use anywhere from 100-400 watts, but don’t get much slower when power limited to say, 250w. So they could power limit the 5090s if they want and get pretty decent performance most likely.

The cat loves laying/basking on it when it’s putting out 1400w in 400w mode though, so I leave it turned up most of the time! (200w for the cpu)

jiggawatts1y ago

May I ask what you’re training? And why not just rent GPUs in some cloud?

2kW is literally the output of my patio heater haha

They work as effective heaters! I haven’t used my (electric) heat all winter, I just use my training computer’s waste heat instead.

m3kw91y ago

You also need to upgrade your air conditioner

Yeah I'm not really sure what the solution is at this point. Put it in my basement and run 50foot HDMI cables through my house or something...

lingonland1y ago

Or just open a window, depending on where you live

sfmike1y ago

One thing I always remember when people say a 2k gpu is insanity. How many people get a 2k ebike. a 100k weekend car. a 15k motorcycle to use once a month. a time share home. Comparatively a gamer using it even a few hours a day for 3k 4090 build is really an amazing return on that investment.

satvikpendem1y ago

Correct, people balk at high GPU prices when others have expensive hobbies too. I think it's because people expect GPUs and PC components to be democratized whereas an expensive car or motorcycle to not be. 5090s are absolutely luxury purchases, no one "needs" one; treat it the same as a sportscar in terms of the clientele able to buy it.

Some of the better video generators with pretty good quality can run on the 32gb version. Expect lots of AI generated videos with this generation of videocards. Price is steep and we need another 9700 ati successtory for some serious nvidia competition. Not going to happen anytime soon I am afraid.

interesting launch but vague in its own way like the one from AMD (less so but in a different way).

it is easy to be carried away with vram size, but keeping in mind that most people with apple silicon (who can enjoy several times more memory) are stuck at inference, while training performance is off the charts through cuda hardware.

the jury is yet to be out on actual ai training performance, but i bet 4090, if sold at 1k or below, would be better value than lower tier 50 series. the "ai tops" of the 50 series is only impressive for the top model, while the rest are either similar or with lower memory bandwidth despite the newer architecture.

i think by now the training is best left on the cloud and overall i'd be happy rather owning a 5070 ti at this rate.

sub71y ago

Would have been nice to get double the memory on the 5090 to run those giant models locally. Would've probably upgraded at 64gb but the jump from 24 to 32gb isn't big enough

Gaming performance has been plateaued for some time now, maybe an 8k monitor wave can revive things

knallfrosch1y ago

Smaller cards with higher power consumption – will GPU water-cooling be cool again?

HumanifyAI1y ago

The most interesting aspect here might be the improved tensor cores for AI workloads - could finally make local LLM inference practical for developers without requiring multiple GPUs.

Mmm i think my wallet Is safe since i only play SNES and old dos games.

pier251y ago

AI is going to push the price closer to $3000. See what happened with crypto a couple of years back.

theandrewbailey1y ago

The ~2017 crypto rush told Nvidia how much people were willing to spend on GPUs, so they priced their next series (RTX 2000) much higher. 2020 came around, wash, rinse, repeat.

Macha1y ago

Note the 20 series bombed, largely because of the price hikes coupled with meager performance gains, so the initial plan was for the 30 series to be much cheaper. But then the 30 series scalping happened and they got a second go at re-anchoring what people thought of as reasonable GPU prices. Also they have diversified other options if gamers won't pay up, compared to just hoping that GPU-minable coins won over those that needed ASICs and the crypto market stayed hot. I can see nVidia being more willing to hurt their gaming market for AI than they ever were for crypto.

Also also, AMD has pretty much thrown in the towel at competing for high end gaming GPUs already.

ChrisArchitect1y ago

Official release: https://nvidianews.nvidia.com/news/nvidia-blackwell-geforce-...

(https://news.ycombinator.com/item?id=42618849)

ryao1y ago

This thread was posted first.

derelicta1y ago

Finally I will be able to run Cities Skylines 2 at 60fps!

reactcore1y ago

GPU stands for graphics prediction unit these days

Meh. Feels like astronomical prices for the smallest upgrades they could get away with.

I miss when high-end GPUs were $300-400, and you could get something reasonable for $100-200. I guess that's just integrated graphics these days.

The most I've ever spent on a GPU is ~$300, and I don't really see that changing anytime soon, so it'll be a long time before I'll even consider one of these cards.

garbageman1y ago

Intel ARC B580 is $249 MSRP and right up your alley in that case.

Yep. If I needed a new GPU, that's what I'd go for. I'm pretty happy with what I have for the moment, though.

yourusername1y ago

>I miss when high-end GPUs were $300-400, and you could get something reasonable for $100-200.

That time is 25 years ago though, i think the Geforce DDR is the last high end card to fit this price bracket. While cards have gotten a lot more expensive those $300 high end cards should be around $600 now. And $200-400 for low end still exists.

oynqr1y ago

2008 is 25 years ago?

snarfy1y ago

I'm really disappointed in all the advancement in frame generation. Game devs will end up relying on it for any decent performance in lieu of actually optimizing anything, which means games will look great and play terribly. It will be 300 fake fps and 30 real fps. Throw latency out the window.

NoPicklez1y ago

This is an odd take I keep hearing, ANY performance increase you could argue that game devs will rely upon it for decent performance.

It doesn't matter if that's through software or hardware improvements.

datagreed1y ago

More fake poor frames at less price

williamDafoe1y ago

It looks like the new cards are NO FASTER than the old cards. So they are hyping the fake frames, fake pixels, fake AI rendering. Anything fake = good, anything real = bad.

This is the same thing they did with the RTX 4000 series. More fake frames, less GPU horsepower, "Moore's Law is Dead", Jensen wrings his hands, "Nothing I can do! Moore's Law is Dead!" which is how Intel has been slacking since 2013.

Its more like the 20 series. Definitely faster and for me worth the upgrade. I just count the transistors for a reference. 92 and 77 billion. So yeah not that much.

vinyl71y ago

Everything is fake these days. We have mass psychosis...everyone is living in a collective schizophrenic delusion

Do they come with a mini nuclear reactor to power them?

wmf1y ago

No, you get that from Enron.

romon1y ago

The future is SMRs next to everyone's home

Did they discontinue Titan series for good?

greenknight1y ago

Last titan was released 2018.... 7 years ago.

They may resurrect it at some stage, but at this stage yes.

ryao1y ago

The 3090, 3090 Ti, 4090 and 5090 are Titan series cards. They are just no longer labelled Titan.

coffeebeqn1y ago

Yes the xx90 is the new Titan

Somewhat related, any recommendations for 'pc builders' where you can configure a PC with the hardware you want, but have it assembled and shipped to you instead of having to build it yourself? With shipping to Canada ideally.

I'm planning to upgrade (prob to a mid-end) as my 5 year old computer is starting to show it's age, and with the new GPUs releasing this might be a good time.

I don't know of any such service, but I'm curious what the value is for you? IMO picking the parts is a lot harder than putting them together.

valzam1y ago

Typically you get warranty on the whole thing through a single merchant, so if anything goes wrong you don't have to deal with the individual parts manufacturers.

zeagle1y ago

Memoryexpress has a system builder tool.

CamperBob21y ago

Puget Systems is worth checking out.

j / k navigate · click thread line to collapse