kmdupree on Hacker News

1

Show HN: FKS2G – LLM-backed metrics for deciding how closely to review code (opens in new tab)

(github.com)

2kmdupree5d ago0

2

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

(philosophicalhacker.com)

4kmdupree28d ago0

3

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

(philosophicalhacker.com)

3kmdupree1mo ago0

4

SWE-bench Verified no longer measures frontier coding capabilities (opens in new tab)

(openai.com)

343kmdupree1mo ago181

5

The Half-Life of a Moat (Part 1) (opens in new tab)

(semistructured.substack.com)

1kmdupree1mo ago0

6

Thoughts about Moments in Claude Mythos System Card (opens in new tab)

(old.reddit.com)

3kmdupree1mo ago0

7

EsoBench: Learning a Novel Esolang via Iterative Execution Feedback (opens in new tab)

(caseys-evals.com)

1kmdupree1mo ago0

8

LLMs and the Russellian Inversion (opens in new tab)

(philosophicalhacker.com)

2kmdupree9mo ago0

9

The great AI coding assistant bait and switch (opens in new tab)

(leaddev.com)

3kmdupree9mo ago0

10

Scientists just developed a new AI modeled on the human brain (opens in new tab)

(livescience.com)

4kmdupree9mo ago0

11

LLMs and the Russellian Inversion (opens in new tab)

(philosophicalhacker.com)

2kmdupree9mo ago0

12

The AI Industry Is Radicalizing (opens in new tab)

(theatlantic.com)

2kmdupree10mo ago0

13

Atlassian migrated 4M Postgres databases to shrink AWS bill (opens in new tab)

(theregister.com)

8kmdupree10mo ago0

14

Libraries are under-used. LLMs make this problem worse (opens in new tab)

(makefizz.buzz)

62kmdupree11mo ago52

15

Lessons from letting AI vibe code a landing page (opens in new tab)

(martech.org)

2kmdupree11mo ago0

kmdupree

Recent submissions

Show HN: FKS2G – LLM-backed metrics for deciding how closely to review code (opens in new tab)

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

SWE-bench Verified no longer measures frontier coding capabilities (opens in new tab)

The Half-Life of a Moat (Part 1) (opens in new tab)

Thoughts about Moments in Claude Mythos System Card (opens in new tab)

EsoBench: Learning a Novel Esolang via Iterative Execution Feedback (opens in new tab)

LLMs and the Russellian Inversion (opens in new tab)

The great AI coding assistant bait and switch (opens in new tab)

Scientists just developed a new AI modeled on the human brain (opens in new tab)

LLMs and the Russellian Inversion (opens in new tab)

The AI Industry Is Radicalizing (opens in new tab)

Atlassian migrated 4M Postgres databases to shrink AWS bill (opens in new tab)

Libraries are under-used. LLMs make this problem worse (opens in new tab)

Lessons from letting AI vibe code a landing page (opens in new tab)

Recent submissions

Show HN: FKS2G – LLM-backed metrics for deciding how closely to review code (opens in new tab)

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)

SWE-bench Verified no longer measures frontier coding capabilities (opens in new tab)

The Half-Life of a Moat (Part 1) (opens in new tab)

Thoughts about Moments in Claude Mythos System Card (opens in new tab)

EsoBench: Learning a Novel Esolang via Iterative Execution Feedback (opens in new tab)

LLMs and the Russellian Inversion (opens in new tab)

The great AI coding assistant bait and switch (opens in new tab)

Scientists just developed a new AI modeled on the human brain (opens in new tab)

LLMs and the Russellian Inversion (opens in new tab)

The AI Industry Is Radicalizing (opens in new tab)

Atlassian migrated 4M Postgres databases to shrink AWS bill (opens in new tab)

Libraries are under-used. LLMs make this problem worse (opens in new tab)

Lessons from letting AI vibe code a landing page (opens in new tab)