Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Apriel-H1: Towards Efficient Enterprise Reasoning Models | Better HN
Apriel-H1: Towards Efficient Enterprise Reasoning Models
(opens in new tab)
(arxiv.org)
1 points
guiriduro
3mo ago
1 comments
Share
1 comments
default
newest
oldest
guiriduro
OP
3mo ago
Apriel-H1-15b-Thinker-SFT uses incremental distillation from Apriel-Nemotron-15B-Thinker, selectively replacing less critical attention layers with linear Mamba blocks to reduce computational complexity while preserving reasoning quality.
j
/
k
navigate · click thread line to collapse