Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
qeternity
3mo ago
0 comments
Share
PyTorch is only part of it. There is still a huge amount of CUDA that isn’t just wrapped by PyTorch and isn’t easily portable.
0 comments
default
newest
oldest
svara
3mo ago
... but not in deep learning or am I missing something important here?
qeternity
OP
3mo ago
Yes, absolutely in deep learning. Custom fused CUDA kernels everywhere.
Scene_Cast2
3mo ago
Yep. MoE, FlashAttention, or sparse retrieval architectures for example.
j
/
k
navigate · click thread line to collapse