moondistance on Hacker News

1

Qwen 3.5: small models with impressive performance (opens in new tab)

(twitter.com)

6moondistance2mo ago0

2

Language Models Are Injective and Hence Invertible (opens in new tab)

(arxiv.org)

1moondistance6mo ago2

3

Autonomous Trash Can (opens in new tab)

(twitter.com)

20moondistance10mo ago1

4

Grok 4 Benchmarks (opens in new tab)

(twitter.com)

3moondistance10mo ago0

5

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

(nvidia.com)

75moondistance1y ago74

6

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

(lgresearch.ai)

3moondistance1y ago1

7

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

(blogs.nvidia.com)

13moondistance1y ago1

8

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

(twitter.com)

4moondistance1y ago0

9

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

(uscc.gov)

4moondistance1y ago0

10

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

(nature.com)

1moondistance1y ago0

11

Llama 405B 506 tokens/second on an H200 (opens in new tab)

(developer.nvidia.com)

21moondistance1y ago5

12

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

(twitter.com)

2moondistance1y ago0

13

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

(decrypt.co)

84moondistance1y ago38

14

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)

(d1qx31qr3h6wln.cloudfront.net)

17moondistance1y ago1

15

Uncensor Any LLM with Abliteration (opens in new tab)

(huggingface.co)

4moondistance1y ago0

moondistance

Recent submissions

Qwen 3.5: small models with impressive performance (opens in new tab)

Language Models Are Injective and Hence Invertible (opens in new tab)

Autonomous Trash Can (opens in new tab)

Grok 4 Benchmarks (opens in new tab)

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

Llama 405B 506 tokens/second on an H200 (opens in new tab)

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)

Uncensor Any LLM with Abliteration (opens in new tab)

Recent submissions

Qwen 3.5: small models with impressive performance (opens in new tab)

Language Models Are Injective and Hence Invertible (opens in new tab)

Autonomous Trash Can (opens in new tab)

Grok 4 Benchmarks (opens in new tab)

Jensen Huang – Nvidia GTC 2025 Keynote (opens in new tab)

Exaone Deep 32B – beats DeepSeek 671B (opens in new tab)

DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (opens in new tab)

ByteDance Doubao-1.5-pro matches GPT 4o benchmarks at 50x cheaper (opens in new tab)

US-China Commission top recommendation: Manhattan project for race to AGI [pdf] (opens in new tab)

AI scans RNA 'dark matter' and uncovers 70k new viruses (opens in new tab)

Llama 405B 506 tokens/second on an H200 (opens in new tab)

SenseNova 5.5 claims SOTA LLM benchmark results (opens in new tab)

New AI Training Technique Is Drastically Faster, Says Google (opens in new tab)

Nvidia open source LLM Nemotron 4 340B at top of the charts [pdf] (opens in new tab)

Uncensor Any LLM with Abliteration (opens in new tab)