Google logo
Google
Open Weights

Gemma 3 1B Instruct

Released Mar 2025

Gemma 3 1B Instruct is a lightweight, text-based open model developed by Google. Built on the research and technology behind the Gemini models, it is the smallest instruction-tuned variant in the Gemma 3 family. It is specifically designed for high-efficiency and low-latency performance on resource-constrained hardware, such as mobile devices and laptops.

Unlike the larger models in the Gemma 3 series, which feature native multimodal capabilities for image and text processing, the 1B variant is specialized for text-only processing. It supports a context window of 32,000 tokens and is optimized for tasks including summarization, question answering, and logical reasoning.

The model architecture utilizes a decoder-only transformer design with a novel interleaved attention mechanism. This strategy alternates between local sliding-window attention and global self-attention layers to manage memory efficiency and mitigate the KV cache bottleneck during long-context inference. It also incorporates QK-normalization and uses the same 256,000-token vocabulary as the Gemini 2.0 series.

Gemma 3 1B Instruct is released with open weights, allowing for commercial and research applications. It is tuned through a combination of knowledge distillation from larger frontier models and reinforcement learning to enhance its instruction-following capabilities and safety profile.

Rankings & Comparison