telexed ~ c / a9eccb67-0edradar:80 · model_apiLIVE
← back
NO.
#a9eccb67
Topic
MODELS & API
Source
r/LocalLLaMA
Published
2026-05-01 12:33:04
Importance
★ 8/10 — radar 80

`gemma-4-31B-it-DFlash` released, but blocked on `llama.cpp` support

Weights are up on Hugging Face, but local testing is still blocked by unmerged llama.cpp PR #22105. Useful only for tracking right now; wait for merge before judging real usability.

[ KEY POINTS ]
  1. The model is already published at huggingface.co/z-lab/gemma-4-31B-it-DFlash, so distribution started before runtime support landed.
  2. Testing is gated by ggml-org/llama.cpp PR #22105; without that merge, local inference flow is effectively blocked.
  3. This is a release you bookmark, not deploy today. The next real checkpoint is PR merge, then compatibility and performance checks.
Originalwww.reddit.com/r/LocalLLaMA/comments/1t0s4qv/gemma431bitdflash_has_been_released/Read original →

// related