robertnishihara on Hacker News

1

Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput (opens in new tab)

(anyscale.com)

2robertnishihara2mo ago1

2

SkyRL brings Tinker to your GPUs (2025) (opens in new tab)

(novasky-ai.notion.site)

24robertnishihara3mo ago5

3

vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep (opens in new tab)

(blog.vllm.ai)

147robertnishihara4mo ago54

4

Massively Parallel Agentic Simulations with Ray (opens in new tab)

(anyscale.com)

2robertnishihara8mo ago0

5

Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (opens in new tab)

(anyscale.com)

1robertnishihara9mo ago0

6

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

(anyscale.com)

1robertnishihara9mo ago0

7

Native LLM APIs in Ray Data and Ray Serve (opens in new tab)

(anyscale.com)

2robertnishihara10mo ago0

8

Joins and Hash-Shuffle in Ray Data (opens in new tab)

(anyscale.com)

3robertnishihara10mo ago0

9

AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training (opens in new tab)

(arxiv.org)

4robertnishihara10mo ago0

10

Open Source RL Libraries for LLMs (opens in new tab)

(anyscale.com)

1robertnishihara10mo ago0

11

Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (opens in new tab)

(anyscale.com)

2robertnishihara10mo ago0

12

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)

(anyscale.com)

44robertnishihara11mo ago10

13

Roll: Reinforcement Learning Optimization for Large-Scale Learning (opens in new tab)

(github.com)

1robertnishihara11mo ago0

14

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

(anyscale.com)

1robertnishihara11mo ago0

15

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)

(anyscale.com)

1robertnishihara1y ago0

robertnishihara

Recent submissions

Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput (opens in new tab)

SkyRL brings Tinker to your GPUs (2025) (opens in new tab)

vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep (opens in new tab)

Massively Parallel Agentic Simulations with Ray (opens in new tab)

Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (opens in new tab)

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

Native LLM APIs in Ray Data and Ray Serve (opens in new tab)

Joins and Hash-Shuffle in Ray Data (opens in new tab)

AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training (opens in new tab)

Open Source RL Libraries for LLMs (opens in new tab)

Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (opens in new tab)

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)

Roll: Reinforcement Learning Optimization for Large-Scale Learning (opens in new tab)

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)

Recent submissions

Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput (opens in new tab)

SkyRL brings Tinker to your GPUs (2025) (opens in new tab)

vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep (opens in new tab)

Massively Parallel Agentic Simulations with Ray (opens in new tab)

Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (opens in new tab)

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

Native LLM APIs in Ray Data and Ray Serve (opens in new tab)

Joins and Hash-Shuffle in Ray Data (opens in new tab)

AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training (opens in new tab)

Open Source RL Libraries for LLMs (opens in new tab)

Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (opens in new tab)

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)

Roll: Reinforcement Learning Optimization for Large-Scale Learning (opens in new tab)

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (opens in new tab)

Uv and Ray: Pain-Free Python Dependencies in Clusters (opens in new tab)