Skip to content

Top New Best Ask Show Jobs

Batched reward model inference and Best-of-N sampling | Better HN

Batched reward model inference and Best-of-N sampling (opens in new tab)

(raw.sh)

34 pointsrawsh1y ago0 comments

0 comments

No comments yet.