The four variants are the Effective 2B (E2B) and Effective 4B (E4B) edge models, designed to run on-device on phones, Raspberry Pi, and Jetson Nano hardware developed in collaboration with the Pixel team, Qualcomm, and MediaTek.
Google announces Gemma 4 open AI models, switches to Apache 2.0 license
The two large Gemma variants, 26B Mixture of Experts and 31B Dense, are designed to run unquantized in bfloat16 format on a single 80GB Nvidia H100 GPU.