Skip to content

Top New Best Ask Show Jobs

ag8 | Better HN

ag8

2,006 karmaJoined July 3, 2019223 submissions

runrl.com

Recent submissions

1

Gourmand Syndrome (opens in new tab)

(en.wikipedia.org)

27ag83mo ago9

2

guys why does armenian completely break Claude (opens in new tab)

(twitter.com)

99ag84mo ago65

3

Sampling at negative temperature (opens in new tab)

(cavendishlabs.org)

203ag84mo ago60

4

Perfectly Replicating Coca Cola [video] (opens in new tab)

(youtube.com)

1ag84mo ago1

5

Po.ta.to (opens in new tab)

(po.ta.to)

4ag86mo ago2

6

Scaling pretraining affects RL sample efficiency (opens in new tab)

(runrl.com)

1ag87mo ago0

7

Systematically generating tests that would have caught Anthropic's top‑K bug (opens in new tab)

(theorem.dev)

2ag87mo ago0

8

Tinker (opens in new tab)

(2b4fdb18.connectionism.pages.dev)

4ag87mo ago2

9

Training Qwen to answer briefly yet intelligently using feedback control (opens in new tab)

(runrl.com)

4ag88mo ago0

10

Launch HN: RunRL (YC X25) – Reinforcement learning as a service (opens in new tab)

(runrl.com)

71ag88mo ago22

11

Generating the Funniest Joke with RL (opens in new tab)

(runrl.com)

1ag81y ago0