1wheel on Hacker News

1

When Models Manipulate Manifolds: The Geometry of a Counting Task (opens in new tab)

(transformer-circuits.pub)

41wheel7mo ago0

2

Advent of Code Solve Times (opens in new tab)

(roadtolarissa.com)

21wheel1y ago0

3

Anthropic raises an additional $4B from Amazon (opens in new tab)

(techcrunch.com)

41wheel1y ago0

4

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (opens in new tab)

(transformer-circuits.pub)

1681wheel2y ago124

5

The Claude 3 Model Family: Opus, Sonnet, Haiku [pdf] (opens in new tab)

(www-cdn.anthropic.com)

331wheel2y ago3

6

Circuits Updates – January 2024 (opens in new tab)

(transformer-circuits.pub)

11wheel2y ago0

7

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of LMs (opens in new tab)

(pair-code.github.io)

21wheel2y ago0

8

Do Machine Learning Models Memorize or Generalize? (opens in new tab)

(pair.withgoogle.com)

4541wheel2y ago210

9

An interactive introduction to grokking and mechanistic interpretability (opens in new tab)

(pair.withgoogle.com)

11wheel2y ago0

10

From Confidently Incorrect Models to Humble Ensembles (opens in new tab)

(pair.withgoogle.com)

11wheel3y ago1

11

Speeding up the JS ecosystem - npm scripts (opens in new tab)

(marvinh.dev)

91wheel3y ago3

12

Searching for Unintended Biases with Saliency (opens in new tab)

(pair.withgoogle.com)

21wheel3y ago1

13

Interactive Visualizations of Federated Learning (opens in new tab)

(pair.withgoogle.com)

11wheel3y ago0

14

How Federated Learning Protects Privacy (opens in new tab)

(pair.withgoogle.com)

21wheel3y ago0

15

It’s Not Spider-Man’s Fault: Why Best Picture Winners Aren’t Hits Anymore (opens in new tab)

(roadtolarissa.com)

21wheel4y ago0

1wheel

Recent submissions

When Models Manipulate Manifolds: The Geometry of a Counting Task (opens in new tab)

Advent of Code Solve Times (opens in new tab)

Anthropic raises an additional $4B from Amazon (opens in new tab)

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (opens in new tab)

The Claude 3 Model Family: Opus, Sonnet, Haiku [pdf] (opens in new tab)

Circuits Updates – January 2024 (opens in new tab)

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of LMs (opens in new tab)

Do Machine Learning Models Memorize or Generalize? (opens in new tab)

An interactive introduction to grokking and mechanistic interpretability (opens in new tab)

From Confidently Incorrect Models to Humble Ensembles (opens in new tab)

Speeding up the JS ecosystem - npm scripts (opens in new tab)

Searching for Unintended Biases with Saliency (opens in new tab)

Interactive Visualizations of Federated Learning (opens in new tab)

How Federated Learning Protects Privacy (opens in new tab)

It’s Not Spider-Man’s Fault: Why Best Picture Winners Aren’t Hits Anymore (opens in new tab)

Recent submissions

When Models Manipulate Manifolds: The Geometry of a Counting Task (opens in new tab)

Advent of Code Solve Times (opens in new tab)

Anthropic raises an additional $4B from Amazon (opens in new tab)

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (opens in new tab)

The Claude 3 Model Family: Opus, Sonnet, Haiku [pdf] (opens in new tab)

Circuits Updates – January 2024 (opens in new tab)

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of LMs (opens in new tab)

Do Machine Learning Models Memorize or Generalize? (opens in new tab)

An interactive introduction to grokking and mechanistic interpretability (opens in new tab)

From Confidently Incorrect Models to Humble Ensembles (opens in new tab)

Speeding up the JS ecosystem - npm scripts (opens in new tab)

Searching for Unintended Biases with Saliency (opens in new tab)

Interactive Visualizations of Federated Learning (opens in new tab)

How Federated Learning Protects Privacy (opens in new tab)

It’s Not Spider-Man’s Fault: Why Best Picture Winners Aren’t Hits Anymore (opens in new tab)