Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Pulze AI Evals | Better HN
Pulze AI Evals
(opens in new tab)
(github.com)
1 points
fbnbr
1y ago
1 comments
Share
1 comments
default
newest
oldest
fbnbr
OP
1y ago
Benchmark AI models on standard datasets like FinanceBench and MMLU.
j
/
k
navigate · click thread line to collapse