Read original ↗
newsReddit r/LocalLLaMATrust 58 · CommunityPublished 7d agoLive · 7d ago

Gemma 4 12b needs glasses

Having a lot of fun using Gemma 4 as an assistant, but is growing frustrated with the poor default image resolution setting for image vision. Tasks like identifying smaller text in an image that Qwen 3.6 flies through, Gemma 4 are never able to decipher. Even larger overall elements of composition it consistently fails at. I tried adding some param to LlamaCpp that supposedly worked with Gemma 4 31b: --image-min-tokens 560 --