DeepSeek's V3 AI model gets a major upgrade - here's what's new
Briefly

DeepSeek has launched its latest model, V3-0324, showcasing notable advancements in reasoning performance and front-end development skills. The model, which is open-source and licensed under MIT, is designed primarily for non-complex reasoning tasks. The company reported a significant jump in its scores on industry-standard benchmarks, especially the AIME math test, beating its previous version by nearly 20 points. Despite these improvements, DeepSeek maintains that its R1 model remains superior for complex reasoning challenges and continues to be a leading competitor in the AI space.
DeepSeek's newly released V3-0324 model enhances reasoning performance and coding skills, while still being best suited for non-complex reasoning tasks.
The company highlighted that V3-0324 scored nearly 20 points higher on the AIME benchmark, demonstrating improved performance over its predecessor.
Despite the advancements, DeepSeek cautions that R1 remains their top model for more complex reasoning tasks and ranks fourth on the Chatbot Arena.
DeepSeek continues to navigate the challenges of benchmark saturation by using more challenging assessments like the AIME to ensure model effectiveness.
Read at ZDNET
[
|
]