Google logo
Google
Open Weights

Gemma 4 E2B (Reasoning)

Released Apr 2026

Intelligence
#280
Coding
#300
Context128K
Parameters5.1B

Gemma 4 E2B is a multimodal edge language model developed by Google DeepMind and released in April 2026. Part of the Gemma 4 family, this model is specifically engineered for high-efficiency on-device deployment, offering native support for text, images, and audio. It is built using the same research foundations as the Gemini 3 proprietary models and is released under a permissive Apache 2.0 license, granting full commercial freedom for developers.

The model introduces an architectural innovation known as Per-Layer Embeddings (PLE). While the model weights total 5.1 billion parameters including large embedding tables, it utilizes only 2.3 billion effective parameters during inference. This design significantly reduces the RAM and battery consumption required for local execution on mobile and IoT devices. The architecture also incorporates a Shared KV Cache and a hybrid attention mechanism that interleaves local sliding window attention with global attention layers to optimize performance and memory footprint.

A primary feature of the E2B variant is its specialized reasoning capabilities. All models in the Gemma 4 family include a configurable "thinking" mode that allows the model to generate internal reasoning traces before producing a final answer. This capability enables higher performance on complex logic, mathematics, and agentic workflows. Additionally, the model natively supports system prompts and multimodal function calling, allowing it to interact with external tools and APIs directly from the edge.

Gemma 4 E2B supports a context window of 128,000 tokens, enabling the processing of long-form documents and large data repositories locally. Its native audio processing allows for automatic speech recognition (ASR) and speech translation without requiring secondary encoder models. The integrated vision encoder supports variable aspect ratios and resolutions, excelling at visual tasks such as OCR, chart understanding, and object detection.

Rankings & Comparison