Intel Announces Inference-Optimized Xe3P Graphics Card with 160GB VRAM (opens in new tab)

(phoronix.com)

209 pointswrigby7mo ago129 comments

129 comments

mft_7mo ago

I have no idea of the likely price, but (IMO) this is the sort of disruption that Intel needs to aim at if it's going to make some sort of dent in this market. If they could release this for around the price of a 5090, it would be very interesting.

Aurornis7mo ago

> If they could release this for around the price of a 5090

This is not targeted at consumers. It’s competing with nVidia’s high RAM workstation cards. Think $10K price range, not $1-2K.

The 160GB of LPDDR5X chips alone is expensive enough that they couldn’t release this at the $2K price point unless they felt like giving it away (which they don’t)

nullsmack7mo ago

They better watch the price because you can get a 128GB AMD Strix Halo mini pc for ~1700-2000 today, and those will be even cheaper a year from now. If they're trying to be competitive then it really needs to be more in that ballpark than the massively overpriced Nvidia range.

Sweepi7mo ago

In the back of my head floats $200 - $300 for the 64 GiB GDDR7 that you get for spending 7k-10k on an ADA 6000 96 GiB instead of 2k for a 5090 32 GiB. Am I off? Is LPDDR5X more expensive?

Sweepi6mo ago

Note for myself: each 3 GiB GDDR7 IC costs $10-$15, x32 sums up to $320-$480, https://www.techpowerup.com/337853/samsung-3-gb-gddr7-chips-...

conradev7mo ago

The GDDR7 found next to most Blackwell chips is more expensive than the LPDDR5X next to this or an M-series/Strix Halo chip.

minraws7mo ago

It's probably spark dgx competition. So around 3-5k would be ideally it.

Any higher and its not really a disruption

musicale7mo ago

Intel made a dent in the consumer gaming market with Battlemage.

They made a dent in the HPC market / Top500 with intel MAX.

It will be interesting to see if they can make a dent in the AI inference market (presumably datacenter/enterprise).

killingtime747mo ago

Not sure where you're getting that from, battlemage has been considered a failure. https://finance.yahoo.com/news/report-indicates-intels-lates...

schmorptron7mo ago

Maybe not that low, but given it's using LPDDR5 instead of GDDR7, at least the ram should be a lot cheaper.

Neywiny7mo ago

Certainly an interesting choice. Dramatically worse performance but dramatically larger only time will tell how it actually goes

magicalhippo7mo ago

With 160GB, surely they can add more channels to compensate?

1 more reply

schmorptron7mo ago

Rumor has it (according to MLID, so no one knows whether it's accurate) that AMD is also looking to use regular LPDDR memory for some of it's lower end next gen GPUs to not have to contend with nvidia over limited and cartelled GDDR7 supply. Maybe they're going to increase parallel bandwidth to compensate it? Or have wholly different tricks up their sleeve.

1 more reply

Tepix7mo ago

It‘s LPDDR5X

wtallis7mo ago

LPDDR5x really just means LPDDR5 running at higher than the original speed of 6400MT/s. Absent any information about which faster speed they'll be using, this correction doesn't add anything to the discussion. Nobody would expect even Intel to use 6400MT/s for a product that far in the future. Where they'll land on the spectrum from 8533 MT/s to 10700 MT/s is just a matter for speculation at the moment.

baq7mo ago

With this much ram don’t expect anything remotely affordable by civilians.

kjellsbells7mo ago

Uncle Sam owns a good chunk of Intel now. "Not affordable by civilians" might be precisely the target market: the DoD/national intelligence agencies have money to burn, can fund things long enough to stabilize Intel a little, and in exchange they get first dibs on everything.

Intel for intel on your Intels, perhaps.

wmf7mo ago

160 GB LPDDR5 is ~$1,200 retail so the card could be sold for $2,000. The price will depend on how desperate Intel is. Intel probably can't copy Nvidia's pricing.

Aurornis7mo ago

> 160 GB LPDDR5 is ~$1,200 retail so the card could be sold for $2,000.

Prices are set by what the market will bear, not the lowest possible price where they could break even on the BOM and manufacturing costs.

The high cost of the LPDDR5X should be a clue that this is going to be in the $10K range, not the $2K range.

baq7mo ago

It’d be a disaster for Intel if it sold for less than 3k, personally I think they’re aiming for break even at 5k a pop at least, and I wouldn’t be surprised to advertise 2x memory at half nvidia price, which would put it at ~15-20k? and a healthy margin which they need like oxygen now. Of course it’s all for naught if it doesn’t perform compute-wise.

3 more replies

dragonwriter7mo ago

I mean, even without that, the phrase “enterprise GPU”, does not tend to convey “priced for typical consumers”.

schmorptron7mo ago

Xe3P as far as I remember is built in their own fabs as opposed to xe3 at TSMC. This could give them a huge advantage by being possibly the only competitor not competing for the same TSMC wafers

makapuf7mo ago

Funny they still call them graphics cards when they're really... I dont know, matmul cards ? Tensor cards ? TPU ? Well that sums it up maybe, what those are are really CUDA cards.

wmf7mo ago

This sounds like a gaming card with extra RAM so it's kind of appropriate to call it a graphics card.

mikkupikku7mo ago

FPUs?

musicale7mo ago

> what those are are really CUDA cards

That don't run CUDA?

halJordan7mo ago

Dude, this is asinine. Graphics cards have been doing matrix and vector operations since they were invented. No one had a problem with calling matrix multiplers graphics cards until it became cool to hate AI.

adastra227mo ago

It was many generations before vector operations were moved onto graphics chips.

pjmlp7mo ago

Only for those not following history of graphics chips.

https://en.wikipedia.org/wiki/TMS34010

> The TMS34010 can execute general purpose programs and is supported by an ANSI C compiler.

> The successor to the TMS34010, the TMS34020 (1988), provides several enhancements including an interface for a special graphics floating point coprocessor, the TMS34082 (1989). The primary function of the TMS34082 is to allow the TMS340 architecture to generate high quality three-dimensional graphics. The performance level of 60 million vertices per second was advanced at the time.

Like these, there were several others other the IBM PC own history.

shwaj7mo ago

I think they’re using “vector” in the linear algebra sense, e.g. multiplying a matrix and a vector produces a different vector.

Not, as I assume you mean, vector graphics like SVG, and renderers like Skia.

1 more reply

boomskats7mo ago

If you s/graphics/3d graphics does that still hold true?

2 more replies

yjftsjthsd-h7mo ago

GPUs may well have done the same-ish operations for a long time, but they were doing those operations for graphics. GPGPU didn't take off until relatively recently.

roenxi7mo ago

Graphics cards haven't ever done graphics. Graphics is a screen thing. Nobody looks at their graphics card to see little pictures. So they are still misnamed, but they've always been misnamed. They do BLAS.

knowitnone37mo ago

Any business people here that can explain why companies announce products a year before their release? I can understand getting consumers excited but it also tells competitors what you are doing giving them time to make changes of their own. What's the advantage here?

jsnell7mo ago

In this case there is no risk of anyone stealing Intel's ideas or even reacting to them.

First, they're not even an also-ran in the AI compute space. Nobody is looking to them for roadmap ideas. Intel does not have any credibility, and no customer is going to be going to Nvidia and demanding that they match Intel.

Second, what exactly would the competitors react to? The only concrete technical detail is that the cards will hopefully launch in 2027 and have 160GB of memory.

The cost of doing this is really low, and the value of potentially getting into the pipeline of people looking to buy data center GPUs in 2027 soon enough to matter is high.

baq7mo ago

Given how long it takes to develop a new GPU I’m pretty sure this one was signed off by Pat and given it survived Lip-Bu’s axe that says something, at least for Intel.

AnthonyMouse7mo ago

If customers know your product exists before they can buy it then they may wait for it. If they buy the competitor's product today because they don't know your product will exist until the day they can buy it then you lose the sale.

Samples of new products also have to go out to third party developers and reviewers ahead of time so that third party support is ready for launch day and that stuff is going to leak to competitors anyway so there's little point in not making it public.

fragmede7mo ago

If you're Intel sized, it's gonna leak. If you announce it first, you get to control the message.

The other thing is enterprise sales is ridiculously slow. If Intel wants corporate customers to buy these things, they've got to announce them ~a year ahead, in order for those customers to buy them next year when they upgrade hardware.

Perenti7mo ago

It can also prevent competitors from entering a particular space. I was told as an undergraduate that UNIX was irrelevant because the upcoming Windows NT would be POSIX compliant. It took a _very_ long time before that happened (and for a very flexible version of "compliant"), but the pointy-headed bosses thought that buying Microsoft was the future. And at first glance the upcoming NT _looked_ as if the TCO would be much lower than AIX, HPuX or Solaris.

Then of course Linux took over everywhere except the desktop.

AnthonyMouse7mo ago

That wasn't even necessarily false. Windows NT on commodity hardware from the likes of Dell arguably did have a lower TCO than proprietary UNIX on proprietary hardware.

But then Linux on that same commodity hardware was lower yet.

epolanski7mo ago

I don't think you're giving much advantage to anybody really on such a small timeframe.

Semiconductors are like container ships, they are extremely slow and hard to steer, you plan today the products you'll release in 2030.

pointyfence7mo ago

It's more than a year. They're sampling this to customers in the second half of 2026. It's a 2027 launch at best.

Intel has practically nothing to show for an AI capex boom for the ages. I suspect that Intel is talking about it early for a shred of AI relevance.

reactordev7mo ago

This is a shareholder “me too” product

thenaturalist7mo ago

What are they gonna do with their own FAB?

Not release anything?

There'll be a good market share for comparatively "lower power/ good enough" local AI. Check out Alez Ziskind's analysis of the B50 Pro [0]. Intel has an entire line-up of cheap GPUs that perform admirably for local use cases.

This guy is building a rack on B580s and the driver update alone has pushed his rig from 30 t/s to 90 t/s. [1]

0: https://www.youtube.com/watch?v=KBbJy-jhsAA

1: https://old.reddit.com/r/LocalLLaMA/comments/1o1k5rc/new_int...

reactordev7mo ago

Watson…

Yeah even RTX’s are limited in this space due to lack of tensor cores. It’s a race to integrate more cores and faster memory buses. My suspicion is this is more me too product announcement so they can play partner to their business opportunities and continue greasing their wheels.

toast07mo ago

Adding on to everyone else. It might help with sales for those with long procurement cycles.

If you're planning a supercomputer to be built in 2027, you want to look at what's on the roadmap.

Mars0087mo ago

To keep investors happy and stock from failing? Fairy tales work as well, see Tesla robots.

teeray7mo ago

> What's the advantage here?

Stock number go up

creaturemachine7mo ago

The AI bubble might not last another year. Better get a few more pumps in before it blows.

Mars0087mo ago

AI is not going anywhere. Now everyone wants to get a piece. Local inference is expected to grow. Documents, image, video, etc processing. Another obvious is driverless farm vehicles and other automated equipment. "Assisted" books, images, news,.. already and grows fast. Translation also a fact.

thenaturalist7mo ago

The technology, maybe - and if on local.

The public co valuations of quickly depreciating chip hoarders selling expensive fever dreams to enterprises are gonna pop though.

Spend 3-7 USD for 20 cents in return and 95% project failures rates for quarters on end aren't gonna go unnoticed on Wall St.

1 more reply

baq7mo ago

There is a serious possibility this isn’t a bubble. Too many people watched the big short and now call every bull a bubble; maybe the bubble was the dollar and it’s popping now instead.

thenaturalist7mo ago

Have you looked in detail at the economics of this?

Career finance professionals are calling it a bubble, not due to their suddenly found deep technological expertise, but because public cos like FAANG et. al are engaging in typical bubble like behavior: Shifting capex away from their balance sheets into SPACs co-financed by private equity.

This is not a consumer debt bubble, it's gonna be a private market bubble.

But as all bubbles go, someones gonna be left holding the bag with society covering for the fallout.

It'll be a rate hike, it'll be some Fortune X00 enterprises cutting their non-ROI-AI-bleed or it'll be an AI-fanboy like Oracle over-leveraging themselves and then watching their credit default swaps going "Boom!" leading to a financing cut off.

1 more reply

cwillu7mo ago

Any discussion of an intel entry to discrete graphics cards needs to at least _mention_ intel's repeated history of abandoning discrete graphics cards.

kobalsky7mo ago

the GPU market is not what it used to be, it's not some checkbox some executive needs to check to say "we are doing something".

the chips are so valuable now NVIDIA will end up owning a chunk of every major tech company, everyone is throwing cash and shares at them as fast as they can.

hnuser1234567mo ago

At least larrabee's cancellation resulted in the Offset engine going to the Firefall (2014) devs, which was a really great F2P MMO game for a while.

sharts7mo ago

You’re saying it’s like the Google of graphics cards?

cwillu7mo ago

Very much so.

bigmattystyles7mo ago

I remember Larabee and Xeon-Phi announcements and getting so excited at the time. So I'll wait but curb my enthusiasm.

Analemma_7mo ago

Yeah, Intel's problem is that this is (at least) the third time they've announced a new ML accelerator platform, and the first two got shitcanned. At this point I wouldn't even glance at an Intel product in this space until it had been on the market for at least five years and several iterations, to be somewhat sure it isn't going to be killed, and Intel's current leadership inspires no confidence that they'll wait that long for success.

wmf7mo ago

Xe works much much better than Larabee or Xeon Phi ever did. Xe3 might even be good.

throwaway1737387mo ago

I’m personally just thinking about how they treated their embedded Keem Bay line. Totally shitcanned without warning. I doubt they consider this a core market to the degree that they will endure bad sales numbers for a while.

eadwu7mo ago

It'll be either "cheap" like the DGX Spark (with crap memory bandwidth) or overpriced with the bus width of a M4 Max with the rhetoric of Intel's 50% margin.

phonon7mo ago

Or it will be cheap, with the ability to expand 8X on a server. Particularly with PCIe 6.0 coming soon, might be a very attractive package.

https://www.linkedin.com/posts/storagereview_storagereview-a...

tonetegeatinst7mo ago

What price is this sitting at? Because if its software support is decent then Intel might have just managed to break into the hardware for AI on the edge. Examples like self hosted LLM finetuning and RAG on a old dell or HP server with these type of cards on them.

Aurornis7mo ago

> Examples like self hosted LLM finetuning and RAG on an old dell or HP server with these type of cards on them.

This won’t be in the price range of an old Dell server or a fun impulse buy for a hobbyist. 160GB of raw LPDDR5X chips alone is not cheap.

This is a server/workstation grade card and the price is going where the market will allow. Consider that an nVidia card with almost half the RAM is going to cost $8K or more. That price point is probably the starting point for where this will be priced, too.

matt-p7mo ago

Maybe not old but if this was say a 6k card that would make it accessible to pretty much any business and at least some hobbyists. 160GB of Lpddr5 should be less than 2k, so it's easily doable if they've got the will. 4 x 5090s is 128GB and probs much more powerful at ~8k, so it would need to be 6/7k to make it make sense.

vardump7mo ago

That nVidia card is going to have 5x the memory bandwidth. LPDDR5X is going to be rather low bandwidth.

(My guess is Intel's card is only going to have about 400 GB/s bandwidth.)

matt-p7mo ago

Last year's M4 max MacBook is 520GB/s and (I expect) that should be closer to 1TB/s in a year or two by the time they are using ddr5. It would be deeply embarrassing if they had worse performance than apples cheaper laptop.

silisili7mo ago

Between 18A becoming viable and this, it seems Intel is finally climbing out of the hole it's been in for years.

Makes me wonder whether Gelsinger put all this in motion, or if the new CEO lit a fire under everyone. Kinda a shame if it's the former...

viraptor7mo ago

Gelsinger had a long term realistic plan. He was out around 11 months ago. You can't magic a new GPU in that timeframe - those projects have 3+ years pipelines for CPUs. I assume GPU will be a bit shorter, but not that much.

Whatever happened with new products today must've been started before he left.

RoyTyrell7mo ago

Will this have any support for open source libraries like PyTorch or will it be all Intel proprietary software that you need a license for?

CoastalCoder7mo ago

Intel puts a huge priority on DL framework support before releasing related hardware, going back to at least 2017.

I assume that hasn't changed.

0xfedcafe7mo ago

OpenVino is entirely open-source and can run PyTorch and ONNX models, so this is definitely not a topic of concern. PyTorch also has native Intel GPU support https://docs.pytorch.org/docs/stable/notes/get_start_xpu.htm...

pjmlp7mo ago

There is PyTorch support on oneAPI.

api7mo ago

A not-absurdly-priced card that can run big models (even quantized) would sell like crazy. Lots and lots of fast RAM is key.

bigwheels7mo ago

How does LPDDR5 (This Xe3P) compare with GDDR7 (Nvidia's flagships) when it comes to inference performance?

Local inference is an interesting proposition because today in real life, the NV H300 and AMD MI-300 clusters are operated by OpenAI and Anthropic in batching mode, which slows users down as they're forced to wait for enough similar sized queries to arrive. For local inference, no waiting is required - so you could get potentially higher throughput.

freeqaz7mo ago

I think the better comparison, for consumers, is how fast is LPDDR5 compared to the normal DDR5 attached to your CPU?

Or, to be more specific, what is the speed when your GPU is out of RAM and it's reading from main memory over the PCI-E bus?

PCI-E 5.0: 64GB/s @ 16x or 32GB/s @ 8x 2x 48GB (96GB) of DDR5 in an AM5 rig: ~50GB/s

Versus the ~300GB/s+ possible with a card like this, it's a lot faster for large 'dense' models. Yes, even an NVIDIA 3090 is ~900GB/s of bandwidth, but it's only 24GB, so even a card like this Xe3P is likely to 'win' because of the higher memory available.

Even if it's 1/3rd of the speed of an old NVIDIA card, it's still 6x+ the speed of what you can get in a desktop today.

MrBuddyCasino7mo ago

This doesn’t matter at all, if the resulting tokens/sec is still too slow for interactive use.

halJordan7mo ago

Lpddr5x (not lpddr5) is 10.7 Gbps. Gddr7 is 32 Gbps. So it's going to be slower

codedokode7mo ago

Yes but in matrix multiplication there are O(N²) numbers and O(N³) multiplications, so it might be possible that you are bounded by compute speed.

1 more reply

qingcharles7mo ago

I asked GPT to pull real stats on both. Looks like the 50-series RAM is about 3X that of the Xe3P, but it wanted to remind me that this new Intel card is designed for data centers and is much lower power, and that the comparable Nvidia server cards (e.g. H200) have even better RAM than GDDR7, so the difference would be even higher for cloud compute.

btian7mo ago

Isn't that precisely what DGX Spark is designed for?

How is this better?

geerlingguy7mo ago

DGX Spark is $4000... this might (might) not be? (and with more memory)

btian7mo ago

This starts shipping in 2027. I'm sure you can buy a DGX Spark for less than $4k in 2 years time.

1 more reply

lillecarl7mo ago

I'm hopeful for the second hand market, imagine when these have paid for themselves and you can do local inference of crazy capable models!?

incomingpain7mo ago

A year out, in that time nvidia and amd; not to mention huawei and others are going to hit the market as well. Intel are quite behind.

To me, the price point is what matters. It's going to be slow with ddr5. The 5090 today is much faster. But sure big ram.

RTX pro 6000 with 96gb of ram will be much faster.

So I'm thinking price point is below the 6000, above the 5090.

DrNosferatu7mo ago

It would be great if they would greatly undercut the price of the NVIDIA DGX Spark.

mawadev7mo ago

Honestly, Intel just has to build a GPU with insane amount of VRAM. It doesn't even have to be the fastest to compete... just a ton of vram for dirt cheap

tommica7mo ago

Isn't this exactly that?

maeln7mo ago

We don't know the pricing yet.

tommica7mo ago

Fair... Hopefully it's consumer friendly. AI absolutely allows new companies to compete in the GPU context, but it's a surprise that no one has made an expansion card for AI usage. Computers have the PCIE slots for that purpose.

jychang7mo ago

It’s LPDDR5x

It’s gonna be slowwww

It’s gonna be what, 273GB/sec vram bandwidth at most? Might as well as buy an AND 395+ 128GB right now for the same inference performance and slightly less VRAM.

kingstnap7mo ago

Bandwidth depends very much on on bus width.

If its fast LPDDR5x (9600 MT/s) with 512 bit bus width (8 64bit channels (actually multiples of quad 16 bit subchannel nonsense)) it could be upwards of 600 GB/s. Lots of bandwidth like the beefy macs have.

jychang7mo ago

1. 600GB/sec is still slow as hell. You might as well as use regular DDR5 RAM then if you're so slow, you can spec regular system DDR5 RAM faster than 600GB/sec. The half decade old consumer 3090 is 1.5x that speed. The current 5090 is 1,792 GB/s. The current nvidia datacenter cards are 8 TB/s. What's the point of having lots of VRAM if your system RAM is faster?

For context: if you have a 160GB dense ML model in VRAM and you're just running 600GB/sec, you can do... roughly 4 tokens per second AT BEST. That massive amount of VRAM is unusable if it's slow.

2. 512 bit LPDDR5x is most likely just 512GB/sec with typical LPDDR5x that's not overly expensive. I would be HIGHLY surprised if they gave it the more expensive RAM that'd break 600GB/sec. The Intel B60 is at 456 GB/s and that's using GDDR6.

Honestly, you're better off waiting for regular DDR6 to come out in a year and just build a system using that.

hengheng7mo ago

How can you tell without knowing the bus width?

maeln7mo ago

Slow is better than nothing. A card with this much VRAM in a "prosumer" price range would be really interesting right now for workstation, to work with big models.

jychang7mo ago

Slow is worse than nothing.

What's the point of this card that's going to be released around the same time as DDR6, and DDR6 will be faster? Might as well as use cheaper system RAM if you system RAM is slower than VRAM.

numpad07mo ago

Slow is better than unavailable

DrNosferatu7mo ago

Anyone has any idea about the price?

g42gregory7mo ago

Anybody knows memory bandwidth?

vrighter7mo ago

does anyone still make gpus for graphics anymore?

nullsmack7mo ago

whoa, shoot this directly into my veins

thedudeabides57mo ago

the mad lad leopold did it, props

Tepix7mo ago

Sound as if it won‘t be widely available before 2027 which disappointing for a 341GB/s chip.

storus7mo ago

Intel leadership actually reads HN? Mindblown...

j / k navigate · click thread line to collapse

129 comments

mft_7mo ago

Aurornis7mo ago

> If they could release this for around the price of a 5090

This is not targeted at consumers. It’s competing with nVidia’s high RAM workstation cards. Think $10K price range, not $1-2K.

The 160GB of LPDDR5X chips alone is expensive enough that they couldn’t release this at the $2K price point unless they felt like giving it away (which they don’t)

nullsmack7mo ago

Sweepi7mo ago

In the back of my head floats $200 - $300 for the 64 GiB GDDR7 that you get for spending 7k-10k on an ADA 6000 96 GiB instead of 2k for a 5090 32 GiB. Am I off? Is LPDDR5X more expensive?

Sweepi6mo ago

Note for myself: each 3 GiB GDDR7 IC costs $10-$15, x32 sums up to $320-$480, https://www.techpowerup.com/337853/samsung-3-gb-gddr7-chips-...

conradev7mo ago

The GDDR7 found next to most Blackwell chips is more expensive than the LPDDR5X next to this or an M-series/Strix Halo chip.

minraws7mo ago

It's probably spark dgx competition. So around 3-5k would be ideally it.

Any higher and its not really a disruption

musicale7mo ago

Intel made a dent in the consumer gaming market with Battlemage.

They made a dent in the HPC market / Top500 with intel MAX.

It will be interesting to see if they can make a dent in the AI inference market (presumably datacenter/enterprise).

killingtime747mo ago

Not sure where you're getting that from, battlemage has been considered a failure. https://finance.yahoo.com/news/report-indicates-intels-lates...

schmorptron7mo ago

Maybe not that low, but given it's using LPDDR5 instead of GDDR7, at least the ram should be a lot cheaper.

Neywiny7mo ago

Certainly an interesting choice. Dramatically worse performance but dramatically larger only time will tell how it actually goes

magicalhippo7mo ago

With 160GB, surely they can add more channels to compensate?

1 more reply

schmorptron7mo ago

1 more reply

Tepix7mo ago

It‘s LPDDR5X

wtallis7mo ago

baq7mo ago

With this much ram don’t expect anything remotely affordable by civilians.

kjellsbells7mo ago

Intel for intel on your Intels, perhaps.

wmf7mo ago

160 GB LPDDR5 is ~$1,200 retail so the card could be sold for $2,000. The price will depend on how desperate Intel is. Intel probably can't copy Nvidia's pricing.

Aurornis7mo ago

> 160 GB LPDDR5 is ~$1,200 retail so the card could be sold for $2,000.

Prices are set by what the market will bear, not the lowest possible price where they could break even on the BOM and manufacturing costs.

The high cost of the LPDDR5X should be a clue that this is going to be in the $10K range, not the $2K range.

baq7mo ago

3 more replies

dragonwriter7mo ago

I mean, even without that, the phrase “enterprise GPU”, does not tend to convey “priced for typical consumers”.

schmorptron7mo ago

Xe3P as far as I remember is built in their own fabs as opposed to xe3 at TSMC. This could give them a huge advantage by being possibly the only competitor not competing for the same TSMC wafers

makapuf7mo ago

Funny they still call them graphics cards when they're really... I dont know, matmul cards ? Tensor cards ? TPU ? Well that sums it up maybe, what those are are really CUDA cards.

wmf7mo ago

This sounds like a gaming card with extra RAM so it's kind of appropriate to call it a graphics card.

mikkupikku7mo ago

FPUs?

musicale7mo ago

> what those are are really CUDA cards

That don't run CUDA?

halJordan7mo ago

adastra227mo ago

It was many generations before vector operations were moved onto graphics chips.

pjmlp7mo ago

Only for those not following history of graphics chips.

https://en.wikipedia.org/wiki/TMS34010

> The TMS34010 can execute general purpose programs and is supported by an ANSI C compiler.

Like these, there were several others other the IBM PC own history.

shwaj7mo ago

I think they’re using “vector” in the linear algebra sense, e.g. multiplying a matrix and a vector produces a different vector.

Not, as I assume you mean, vector graphics like SVG, and renderers like Skia.

1 more reply

boomskats7mo ago

If you s/graphics/3d graphics does that still hold true?

2 more replies

yjftsjthsd-h7mo ago

GPUs may well have done the same-ish operations for a long time, but they were doing those operations for graphics. GPGPU didn't take off until relatively recently.

roenxi7mo ago

knowitnone37mo ago

jsnell7mo ago

In this case there is no risk of anyone stealing Intel's ideas or even reacting to them.

Second, what exactly would the competitors react to? The only concrete technical detail is that the cards will hopefully launch in 2027 and have 160GB of memory.

The cost of doing this is really low, and the value of potentially getting into the pipeline of people looking to buy data center GPUs in 2027 soon enough to matter is high.

baq7mo ago

Given how long it takes to develop a new GPU I’m pretty sure this one was signed off by Pat and given it survived Lip-Bu’s axe that says something, at least for Intel.

AnthonyMouse7mo ago

fragmede7mo ago

If you're Intel sized, it's gonna leak. If you announce it first, you get to control the message.

Perenti7mo ago

Then of course Linux took over everywhere except the desktop.

AnthonyMouse7mo ago

That wasn't even necessarily false. Windows NT on commodity hardware from the likes of Dell arguably did have a lower TCO than proprietary UNIX on proprietary hardware.

But then Linux on that same commodity hardware was lower yet.

epolanski7mo ago

I don't think you're giving much advantage to anybody really on such a small timeframe.

Semiconductors are like container ships, they are extremely slow and hard to steer, you plan today the products you'll release in 2030.

pointyfence7mo ago

It's more than a year. They're sampling this to customers in the second half of 2026. It's a 2027 launch at best.

Intel has practically nothing to show for an AI capex boom for the ages. I suspect that Intel is talking about it early for a shred of AI relevance.

reactordev7mo ago

This is a shareholder “me too” product

thenaturalist7mo ago

What are they gonna do with their own FAB?

Not release anything?

This guy is building a rack on B580s and the driver update alone has pushed his rig from 30 t/s to 90 t/s. [1]

0: https://www.youtube.com/watch?v=KBbJy-jhsAA

1: https://old.reddit.com/r/LocalLLaMA/comments/1o1k5rc/new_int...

reactordev7mo ago

Watson…

toast07mo ago

Adding on to everyone else. It might help with sales for those with long procurement cycles.

If you're planning a supercomputer to be built in 2027, you want to look at what's on the roadmap.

Mars0087mo ago

To keep investors happy and stock from failing? Fairy tales work as well, see Tesla robots.

teeray7mo ago

> What's the advantage here?

Stock number go up

creaturemachine7mo ago

The AI bubble might not last another year. Better get a few more pumps in before it blows.

Mars0087mo ago

thenaturalist7mo ago

The technology, maybe - and if on local.

The public co valuations of quickly depreciating chip hoarders selling expensive fever dreams to enterprises are gonna pop though.

Spend 3-7 USD for 20 cents in return and 95% project failures rates for quarters on end aren't gonna go unnoticed on Wall St.

1 more reply

baq7mo ago

There is a serious possibility this isn’t a bubble. Too many people watched the big short and now call every bull a bubble; maybe the bubble was the dollar and it’s popping now instead.

thenaturalist7mo ago

Have you looked in detail at the economics of this?

This is not a consumer debt bubble, it's gonna be a private market bubble.

But as all bubbles go, someones gonna be left holding the bag with society covering for the fallout.

1 more reply

cwillu7mo ago

Any discussion of an intel entry to discrete graphics cards needs to at least _mention_ intel's repeated history of abandoning discrete graphics cards.

kobalsky7mo ago

the GPU market is not what it used to be, it's not some checkbox some executive needs to check to say "we are doing something".

the chips are so valuable now NVIDIA will end up owning a chunk of every major tech company, everyone is throwing cash and shares at them as fast as they can.

hnuser1234567mo ago

At least larrabee's cancellation resulted in the Offset engine going to the Firefall (2014) devs, which was a really great F2P MMO game for a while.

sharts7mo ago

You’re saying it’s like the Google of graphics cards?

cwillu7mo ago

Very much so.

bigmattystyles7mo ago

I remember Larabee and Xeon-Phi announcements and getting so excited at the time. So I'll wait but curb my enthusiasm.

Analemma_7mo ago

wmf7mo ago

Xe works much much better than Larabee or Xeon Phi ever did. Xe3 might even be good.

throwaway1737387mo ago

eadwu7mo ago

It'll be either "cheap" like the DGX Spark (with crap memory bandwidth) or overpriced with the bus width of a M4 Max with the rhetoric of Intel's 50% margin.

phonon7mo ago

Or it will be cheap, with the ability to expand 8X on a server. Particularly with PCIe 6.0 coming soon, might be a very attractive package.

https://www.linkedin.com/posts/storagereview_storagereview-a...

tonetegeatinst7mo ago

Aurornis7mo ago

> Examples like self hosted LLM finetuning and RAG on an old dell or HP server with these type of cards on them.

This won’t be in the price range of an old Dell server or a fun impulse buy for a hobbyist. 160GB of raw LPDDR5X chips alone is not cheap.

matt-p7mo ago

vardump7mo ago

That nVidia card is going to have 5x the memory bandwidth. LPDDR5X is going to be rather low bandwidth.

(My guess is Intel's card is only going to have about 400 GB/s bandwidth.)

matt-p7mo ago

silisili7mo ago

Between 18A becoming viable and this, it seems Intel is finally climbing out of the hole it's been in for years.

Makes me wonder whether Gelsinger put all this in motion, or if the new CEO lit a fire under everyone. Kinda a shame if it's the former...

viraptor7mo ago

Whatever happened with new products today must've been started before he left.

RoyTyrell7mo ago

Will this have any support for open source libraries like PyTorch or will it be all Intel proprietary software that you need a license for?

CoastalCoder7mo ago

Intel puts a huge priority on DL framework support before releasing related hardware, going back to at least 2017.

I assume that hasn't changed.

0xfedcafe7mo ago

pjmlp7mo ago

There is PyTorch support on oneAPI.

api7mo ago

A not-absurdly-priced card that can run big models (even quantized) would sell like crazy. Lots and lots of fast RAM is key.

bigwheels7mo ago

How does LPDDR5 (This Xe3P) compare with GDDR7 (Nvidia's flagships) when it comes to inference performance?

freeqaz7mo ago

I think the better comparison, for consumers, is how fast is LPDDR5 compared to the normal DDR5 attached to your CPU?

Or, to be more specific, what is the speed when your GPU is out of RAM and it's reading from main memory over the PCI-E bus?

PCI-E 5.0: 64GB/s @ 16x or 32GB/s @ 8x 2x 48GB (96GB) of DDR5 in an AM5 rig: ~50GB/s

Even if it's 1/3rd of the speed of an old NVIDIA card, it's still 6x+ the speed of what you can get in a desktop today.

MrBuddyCasino7mo ago

This doesn’t matter at all, if the resulting tokens/sec is still too slow for interactive use.

halJordan7mo ago

Lpddr5x (not lpddr5) is 10.7 Gbps. Gddr7 is 32 Gbps. So it's going to be slower

codedokode7mo ago

Yes but in matrix multiplication there are O(N²) numbers and O(N³) multiplications, so it might be possible that you are bounded by compute speed.

1 more reply

qingcharles7mo ago

btian7mo ago

Isn't that precisely what DGX Spark is designed for?

How is this better?

geerlingguy7mo ago

DGX Spark is $4000... this might (might) not be? (and with more memory)

btian7mo ago

This starts shipping in 2027. I'm sure you can buy a DGX Spark for less than $4k in 2 years time.

1 more reply

lillecarl7mo ago

I'm hopeful for the second hand market, imagine when these have paid for themselves and you can do local inference of crazy capable models!?

incomingpain7mo ago

A year out, in that time nvidia and amd; not to mention huawei and others are going to hit the market as well. Intel are quite behind.

To me, the price point is what matters. It's going to be slow with ddr5. The 5090 today is much faster. But sure big ram.

RTX pro 6000 with 96gb of ram will be much faster.

So I'm thinking price point is below the 6000, above the 5090.

DrNosferatu7mo ago

It would be great if they would greatly undercut the price of the NVIDIA DGX Spark.

mawadev7mo ago

Honestly, Intel just has to build a GPU with insane amount of VRAM. It doesn't even have to be the fastest to compete... just a ton of vram for dirt cheap

tommica7mo ago

Isn't this exactly that?

maeln7mo ago

We don't know the pricing yet.

tommica7mo ago

jychang7mo ago

It’s LPDDR5x

It’s gonna be slowwww

It’s gonna be what, 273GB/sec vram bandwidth at most? Might as well as buy an AND 395+ 128GB right now for the same inference performance and slightly less VRAM.

kingstnap7mo ago

Bandwidth depends very much on on bus width.

jychang7mo ago

For context: if you have a 160GB dense ML model in VRAM and you're just running 600GB/sec, you can do... roughly 4 tokens per second AT BEST. That massive amount of VRAM is unusable if it's slow.

Honestly, you're better off waiting for regular DDR6 to come out in a year and just build a system using that.

hengheng7mo ago

How can you tell without knowing the bus width?

maeln7mo ago

Slow is better than nothing. A card with this much VRAM in a "prosumer" price range would be really interesting right now for workstation, to work with big models.

jychang7mo ago

Slow is worse than nothing.

What's the point of this card that's going to be released around the same time as DDR6, and DDR6 will be faster? Might as well as use cheaper system RAM if you system RAM is slower than VRAM.

numpad07mo ago

Slow is better than unavailable

DrNosferatu7mo ago

Anyone has any idea about the price?

g42gregory7mo ago

Anybody knows memory bandwidth?

vrighter7mo ago

does anyone still make gpus for graphics anymore?

nullsmack7mo ago

whoa, shoot this directly into my veins

thedudeabides57mo ago

the mad lad leopold did it, props

Tepix7mo ago

Sound as if it won‘t be widely available before 2027 which disappointing for a 341GB/s chip.

storus7mo ago

Intel leadership actually reads HN? Mindblown...

j / k navigate · click thread line to collapse