undefined | Better HN

0 pointsminimaxir2y ago0 comments

Sure, the 8GB VRAM gaming GPUs aren't designed for LLMs (and would effectively get zero benefit from the data throughput of GPU-accelerated data frames compared to typical approaches), but the 80GB A100s server GPUs definitely are.

0 comments

CaptainOfCoit2y ago

> but the 80GB A100s server GPUs definitely are

I'm sure LLMs were considered, like many other ML use cases, but that A100 was intended for LLMs? I'm unsure about that.

A100 was released the same year as GPT3, and it wasn't until GPT3 went live that people really started pay attention. Then I'm sure designing and producing a GPU takes a longer time than a couple of months.

j / k navigate · click thread line to collapse

0 comments

CaptainOfCoit2y ago

> but the 80GB A100s server GPUs definitely are

I'm sure LLMs were considered, like many other ML use cases, but that A100 was intended for LLMs? I'm unsure about that.

j / k navigate · click thread line to collapse