1Making Apple Neural Engine work in a custom inference stack (opens in new tab)(engineering.drawthings.ai)1liuliu1mo ago0
2Show HN: Metal Quantized Attention on M5 Max (opens in new tab)(releases.drawthings.ai)4liuliu1mo ago0