nserrino on Hacker News

1

Low-Latency Inference with Speculative Decoding on D-Matrix Corsair and GPU (opens in new tab)

(gimletlabs.ai)

1nserrino2mo ago0

2

The emerging role of SRAM-centric chips in AI inference (opens in new tab)

(gimletlabs.ai)

3nserrino2mo ago0

3

Speeding up PyTorch inference on Apple devices with AI-generated Metal kernels (opens in new tab)

(gimletlabs.ai)

187nserrino8mo ago30

4

Show HN: Pixie, open source observability for Kubernetes using eBPF (opens in new tab)

(github.com)

6nserrino3y ago3

5

Dumpster diving the Go garbage collector (opens in new tab)

(blog.px.dev)

13nserrino4y ago0

6

Observing HTTP/2 Traffic Is Hard, but eBPF Can Help (opens in new tab)

(blog.px.dev)

91nserrino4y ago4

7

Did I get owned by Log4Shell? (opens in new tab)

(blog.px.dev)

3nserrino4y ago0

8

Distributed Bpftrace with Pixie (opens in new tab)

(blog.px.dev)

4nserrino4y ago0

9

Horizontal Pod Autoscaling with Custom Metrics in Kubernetes (opens in new tab)

(blog.px.dev)

4nserrino4y ago0

10

Open sourcing Pixie under Apache 2.0 license (opens in new tab)

(blog.px.dev)

108nserrino5y ago18

11

Want to Debug Latency? (2018) (opens in new tab)

(rakyll.medium.com)

2nserrino5y ago0

12

A thorough introduction to eBPF (2017) (opens in new tab)

(lwn.net)

3nserrino5y ago0

13

How Etcd works and tips to keep in mind (opens in new tab)

(blog.pixielabs.ai)

7nserrino5y ago0

nserrino

Recent submissions

Low-Latency Inference with Speculative Decoding on D-Matrix Corsair and GPU (opens in new tab)

The emerging role of SRAM-centric chips in AI inference (opens in new tab)

Speeding up PyTorch inference on Apple devices with AI-generated Metal kernels (opens in new tab)

Show HN: Pixie, open source observability for Kubernetes using eBPF (opens in new tab)

Dumpster diving the Go garbage collector (opens in new tab)

Observing HTTP/2 Traffic Is Hard, but eBPF Can Help (opens in new tab)

Did I get owned by Log4Shell? (opens in new tab)

Distributed Bpftrace with Pixie (opens in new tab)

Horizontal Pod Autoscaling with Custom Metrics in Kubernetes (opens in new tab)

Open sourcing Pixie under Apache 2.0 license (opens in new tab)

Want to Debug Latency? (2018) (opens in new tab)

A thorough introduction to eBPF (2017) (opens in new tab)

How Etcd works and tips to keep in mind (opens in new tab)

Recent submissions

Low-Latency Inference with Speculative Decoding on D-Matrix Corsair and GPU (opens in new tab)

The emerging role of SRAM-centric chips in AI inference (opens in new tab)

Speeding up PyTorch inference on Apple devices with AI-generated Metal kernels (opens in new tab)

Show HN: Pixie, open source observability for Kubernetes using eBPF (opens in new tab)

Dumpster diving the Go garbage collector (opens in new tab)

Observing HTTP/2 Traffic Is Hard, but eBPF Can Help (opens in new tab)

Did I get owned by Log4Shell? (opens in new tab)

Distributed Bpftrace with Pixie (opens in new tab)

Horizontal Pod Autoscaling with Custom Metrics in Kubernetes (opens in new tab)

Open sourcing Pixie under Apache 2.0 license (opens in new tab)

Want to Debug Latency? (2018) (opens in new tab)

A thorough introduction to eBPF (2017) (opens in new tab)

How Etcd works and tips to keep in mind (opens in new tab)