1Accelerating LLM Inference with Parallel Draft Models (PARD) (opens in new tab)(amd.com)1dhruvdh11mo ago0
2Open-sourcing Three EXAONE 3.5 Models: 2.4B, 7.8B, 32B (opens in new tab)(lgresearch.ai)13dhruvdh1y ago4