Skip to content
Better HN
Fastgen – SOTA LLM inference in 3k lines of Python | Better HN