Mixtral 8x7B
Mistral's efficient mixture-of-experts model
Mixtral 8x7B
Mistral AI • December 2023
Training Data
Up to early 2023
Mixtral 8x7B
December 2023
Parameters
8x7B (MoE)
Training Method
Mixture of Experts
Context Window
32,000 tokens
Knowledge Cutoff
September 2023
Key Features
Mixture of Experts • High Efficiency • Open Weights
Capabilities
Reasoning: Good
Coding: Very Good
Efficiency: Excellent
What's New in This Version
Efficient sparse activation with strong performance
Mistral's efficient mixture-of-experts model
What's New in This Version
Efficient sparse activation with strong performance
Technical Specifications
Key Features
Capabilities
Other Mistral AI Models
Explore more models from Mistral AI
Mistral Small 4
Mistral's unified open-weight MoE model combining reasoning, multimodal, and coding capabilities under Apache 2.0 with only 6.5B active parameters
Mistral Large 3
Mistral's state-of-the-art open-weight frontier model with multimodal and multilingual capabilities under Apache 2.0
Ministral 3 14B
Mistral's high-performance dense model in the new Ministral 3 family