Skip to content

Top New Best Ask Show Jobs

volodia | Better HN

volodia

545 karmaJoined May 28, 200889 submissions

Recent submissions

1

Mercury 2 on PinchBench: Diffusion LLM benchmarked on real OpenClaw agent tasks (opens in new tab)

(inceptionlabs.ai)

2volodia2mo ago0

2

Mercury 2: Best-in-class speed-optimized intelligence at 1,200 tok/SEC (opens in new tab)

(twitter.com)

1volodia3mo ago0

3

Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers (opens in new tab)

(arxiv.org)

2volodia2y ago0

4

LLMTune: 4-Bit finetuning of 65B LLAMA models on a single consumer GPU (opens in new tab)

(github.com)

3volodia3y ago0

5

LLMTune: 4-Bit Finetuning of LLMs on a Consumer GPI (opens in new tab)

(twitter.com)

2volodia3y ago0

6

Don't have a $5k MacBook to run LLAMA65B? MiniLLM runs LLMs on GPUs in <500 LOC (opens in new tab)

(github.com)

3volodia3y ago2