DeepSeek-V3. 2: Pushing the Frontier of Open Large Language Models We introduce DeepSeek-V3 2, a model that harmonizes high computational efficiency with superior reasoning and agent performance The key technical breakthroughs of DeepSeek-V3 2 are as follows: (1) DeepSeek Sparse Attention (DSA): We introduce DSA, an efficient attention mechanism that substantially reduces computational complexity while preserving model performance in long-context scenarios
DeepSeek · GitHub Python 23,076 MIT 2,139 247 (3 issues need help) 38 Updated on Jan 26 DualPipe Public A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3 R1 training
DeepSeek | 深度求索 Founded in 2023, DeepSeek focuses on researching world-leading general artificial intelligence (AI) underlying models and technologies, tackling cutting-edge AI challenges
DeepSeek - Wikipedia Based in Hangzhou, Zhejiang, DeepSeek is owned and funded by High-Flyer, a Chinese hedge fund DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both of the companies [7][8][9] The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025