undefined | Better HN

0 pointspldpb1y ago0 comments

It's a deadend. SRAM doesn't scale on advanced nodes.

Similar to Tenstorrent who chose GDDR instead of HBM, they throught production AI models won't get bigger than GPT3.5 due to cost.

0 comments

txyx3031y ago

I don't think they rely on SRAM very much for training. https://cerebras.ai/blog/the-complete-guide-to-scale-out-on-... outlines the memory architecture but it seems like they are able to keep most of the storage off wafer which is how they scale to 100s of GB of parameters with "only" 10s of GB of SRAM.

j / k navigate · click thread line to collapse

0 pointspldpb1y ago0 comments

It's a deadend. SRAM doesn't scale on advanced nodes.

Similar to Tenstorrent who chose GDDR instead of HBM, they throught production AI models won't get bigger than GPT3.5 due to cost.

0 comments

txyx3031y ago

j / k navigate · click thread line to collapse