#long-context-128k
#long-context-128k

[ follow ]

DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture

DeepSeek V3.1 combines a hybrid thinking/non-thinking architecture, 128k-token context, FP8 precision, 671B parameters, and strong cost-efficient coding and reasoning performance.

[ Load more ]

#long-context-128k#long-context-128k

DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture

#long-context-128k
#long-context-128k