Google logo
Google

Gemini 1.5 Pro (Sep '24)

Released Sep 2024

Gemini 1.5 Pro (Sep '24) is a performance-optimized version of Google's multimodal large language model, specifically the production-ready gemini-1.5-pro-002 iteration. Released on September 24, 2024, this update provides substantial performance improvements over the initial Gemini 1.5 Pro release, particularly in the areas of reasoning, mathematics, and coding. It is built using a Mixture-of-Experts (MoE) architecture, which enhances efficiency by activating only a specialized subset of the model's parameters for any given task.\n\nA defining feature of the model is its massive context window, supporting up to 2 million tokens. This allows it to process and reason across extensive datasets, such as 1,500-page documents, hour-long videos, or large-scale code repositories in a single prompt. The September 2024 update specifically targeted improvements in long-context accuracy and retrieval, making the model more reliable for complex synthesis and deep research tasks.\n\n## Performance and Capabilities\nBenchmarks for the September 2024 release show a 20% improvement in mathematical problem-solving and significant gains in visual understanding and Python code generation. As a natively multimodal model, Gemini 1.5 Pro (Sep '24) processes text, images, audio, and video directly without requiring separate encoders. The update also improved the model's generation speed and reduced latency, while refining its ability to follow complex instructions and generate concise, structured outputs such as JSON.

Rankings & Comparison