Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
Jensson
1y ago
0 comments
Share
We know exactly why, it is because floating point operations aren't associative but the GPU scheduler assumes they are, and the scheduler isn't deterministic. Running the model strictly hurts performance so they don't do that.
0 comments
default
newest
oldest
fzzzy
1y ago
Cool, thanks a lot for the explanation. Makes sense.
j
/
k
navigate · click thread line to collapse