Meta logo
Meta
Open Weights

Llama 3.1 Instruct 70B

Released Jul 2024

Llama 3.1 Instruct 70B is a large-scale generative model developed by Meta as part of the Llama 3.1 collection. It is an auto-regressive language model built on an optimized transformer architecture utilizing Grouped-Query Attention (GQA) for improved inference scalability. The model is instruction-tuned specifically for multilingual dialogue and complex reasoning tasks.

One of the most significant updates in the 3.1 version is the expansion of the context window to 128,000 tokens, a sixteenfold increase over the original Llama 3 release. This enables the model to process extensive documents and maintain longer conversational coherence. Additionally, it offers official support for eight languages, including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

The model was pre-trained on a dataset of approximately 15 trillion tokens, with a knowledge cutoff of December 2023. The instruction-tuned variant underwent a rigorous fine-tuning process involving supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to improve alignment with human preferences for helpfulness and safety. It is released under the Llama 3.1 Community License, allowing for both commercial and research applications.

Rankings & Comparison