1ns scale ultra-low-latency fabric over shm and MMAP for IPC (opens in new tab)(crates.io)2venkat_28114h ago0
2Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference (opens in new tab)(github.com)1venkat_28114mo ago1