1Specula: A framework for finding deep bugs in system code using TLA+ (opens in new tab)(github.com)3matt_d6h ago0
2Equality Saturation for Optimizing High-Level Julia IR (opens in new tab)(dl.acm.org)1matt_d10h ago2
3UniTe: A Universal Tensor Abstraction for Capturing Spatial Relationships (opens in new tab)(dl.acm.org)1matt_d10h ago0
4Co-Design of B+-Tree Index with Emerging Zone Interfaces for Small KV Pairs (opens in new tab)(dl.acm.org)2matt_d10h ago0
5CounterPoint: Using Hardware Counters to Refute and Refine µarch Assumptions (opens in new tab)(arxiv.org)1matt_d10h ago0
6PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost (opens in new tab)(arxiv.org)1matt_d1d ago0
7SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems (opens in new tab)(muratbuffalo.blogspot.com)5matt_d1d ago0
9Idempotent Slices with Applications to Code-Size Reduction (opens in new tab)(arxiv.org)1matt_d1d ago0
10Microsoft Rust Training Books: Beginner, advanced, expert level material (opens in new tab)(github.com)3matt_d2d ago0
11LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis (opens in new tab)(arxiv.org)3matt_d2d ago0
12Challenges and Design Issues in Finding CUDA Bugs via GPU-Native Fuzzing (opens in new tab)(arxiv.org)2matt_d2d ago0
13SEVI: Silent Data Corruption of Vector Instructions in Hyper-Scale Datacenters (opens in new tab)(dl.acm.org)1matt_d2d ago0
14CrypTorch: PyTorch-based Auto-tuning Compiler for ML w/ Multi-party Computation (opens in new tab)(github.com)2matt_d3d ago0
15SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels (opens in new tab)(arxiv.org)3matt_d5d ago0