DeepSeek-V3

Open · DeepSeek · Language model

Large MoE model offering frontier-class quality at low inference cost, openly available.

Specifications

What it was trained for

General-purpose language understanding, generation, coding, and reasoning as an efficient mixture-of-experts model.

Mixture-of-experts (subset of parameters per token)Strong multilingual abilitySolid coding and math performanceOpen weights (MIT)Long context support

A strong open MoE general model competitive with leading proprietary chat models while keeping inference cost low through sparse expert activation.