DiffusionGemma promises 1,000 tok/s but hits 43 on Mac. See why autoregressive Gemma wins on Apple Silicon with real benchmarks and data.
I Ran Google's 1,000-Tokens-Per-Second Model…
DiffusionGemma promises 1,000 tok/s but hits 43 on Mac. See why autoregressive Gemma wins on Apple Silicon with real benchmarks and data.