Theory of Change
DeepSeek's stated goal is artificial general intelligence (AGI), pursued through open research rather than commercialization. Founder Liang Wenfeng: "Our goal is AGI -- artificial general intelligence. Large language models are likely a necessary step toward AGI and already exhibit some of its characteristics, so we'll start there." He frames this as curiosity-driven research inseparable from a national project: "We believe that as the economy develops, China should gradually become a contributor rather than always free-riding. Over the past three decades of IT waves, we've hardly participated in real technological innovation."
The theory of change, such as it is: build frontier AI capabilities through algorithmic innovation, release everything as open-weight under MIT License, and thereby advance both AGI research and China's position as a technology innovator. Liang: "Open-sourcing and publishing papers don't mean we lose anything. For technical professionals, being followed is a deeply fulfilling experience. In fact, open source is more of a cultural act than a commercial one."
DeepSeek does not articulate any theory of change related to AI safety. The concept does not appear in any of Liang's interviews, DeepSeek's publications, or the company's public communications.
What They Do
DeepSeek has released 8+ major model families in ~2 years, each introducing genuine architectural innovations:
- DeepSeek-V2 (May 2024): Multi-Head Latent Attention (MLA, reducing KV cache 93%) and DeepSeekMoE with shared experts. Triggered a Chinese AI price war.
- DeepSeek-V3 (Dec 2024): 671B parameters, FP8 training, multi-token prediction. Claimed $5.6M training cost (GPU hours only; total infrastructure estimated at $1.6B by SemiAnalysis).
- DeepSeek-R1 (Jan 2025): Reasoning model comparable to OpenAI o1. R1-Zero demonstrated emergent reasoning from pure RL. Became #1 free app in US App Store; caused $600B Nvidia crash and $1T+ tech selloff.
- DeepSeek V3.1 (Aug 2025): Hybrid thinking/non-thinking modes. 40%+ improvement on SWE-bench over predecessors.
- DeepSeek V3.2 (Dec 2025): Comparable to GPT-5. Sparse attention innovation.
- V4 (expected Mar 2026): Reportedly trained on Nvidia Blackwell chips. Being optimized for Huawei's Ascend chips first, with US chipmakers excluded from early access.
All models released under MIT License. The technical innovations are genuine -- PTX-level GPU optimization (below CUDA, extremely rare), custom distributed training infrastructure, novel attention mechanisms. Stanford's Christopher Manning: "It's a sad state of affairs for what has long been an open country advancing open science and engineering that the best way to learn about the details of modern LLM design and engineering is currently to read the thorough technical reports of Chinese companies."
The company has 96M+ monthly active users. Downloads on Hugging Face increased nearly 1,000% since January 2025.
Key People
Liang Wenfeng (born 1985) -- Founder and CEO of both DeepSeek and High-Flyer hedge fund. BEng/MEng Zhejiang University. Quant trader since the 2008 financial crisis. Declined a co-founder offer from DJI's Frank Wang. Wrote the Chinese preface for the book on Jim Simons. Named to Time 100 AI. Net worth ~$4.6B (Hurun). Holds 84% of DeepSeek through two shell corporations. Met Xi Jinping at a tech summit in February 2025. Liang is DeepSeek -- there is no visible second-in-command, no independent board, no named CTO.
Team: ~150 employees, flat organization, no KPIs, no assigned roles. Hiring philosophy is "passion and curiosity over experience" -- mostly fresh graduates from Chinese universities. Salaries reportedly exceed $1.3M for promising candidates. Hoover Institution analysis: 98% of researchers have Chinese institutional ties, only 24% had US connections (mostly brief). The V2 model was built "entirely by local talent -- no one returned from overseas."
Notable absences: No safety lead, no safety team, no safety researchers with public profiles. Zero employee perspectives found in the evidence base -- no interviews, no departures, no public statements from anyone other than Liang.
Money and Incentives
Funding: Entirely from High-Flyer hedge fund. No VC, no grants, no external investors. High-Flyer AUM peaked at ~$14B (2021), currently ~$10B. Liang: "Money has never been the problem for us; bans on shipments of advanced chips are the problem." This is an extraordinary funding model -- DeepSeek operates without accountability to investors, regulators, or the public. Early infrastructure investment: $180M spent on data centers and 10,000+ A100 GPUs by mid-2022, before US export controls. Fire-Flyer 1 (2019, 1,100 GPUs, 200M yuan) and Fire-Flyer 2 (2021, 1B yuan budget) established custom training capabilities that preceded DeepSeek's formal founding.
True costs: The $5.6M training cost claim covers only the final successful V3 pretraining run. SemiAnalysis estimates:
- Total server capex: ~$1.6B
- Operating costs: ~$944M
- GPU inventory: ~50,000 Hopper GPUs including ~10K H800, ~10K H100 (illegal under US export controls), ~30K H20, ~10K A100
- Infrastructure built pre-ban: by mid-2022, $180M spent on data centers with 10,000+ A100 GPUs
Revenue model: None. Liang: "If you're looking for a purely commercial rationale, you might not find one because it's not cost-effective." API pricing reportedly at or below cost. The company may be considering outside investment for the first time (mid-2025 reports).
Incentive structure: DeepSeek's incentives are uniquely aligned with Liang's personal vision and uniquely misaligned with safety. There is no investor demanding responsible deployment. There is no regulatory body with effective oversight. There is no board to push back on risky releases. There is no revenue model that could be threatened by safety incidents. There is no competitor in the Chinese market with higher safety standards that could create competitive pressure for safety. The only external pressure on DeepSeek's content comes from CCP censorship requirements, which have nothing to do with AI safety and actively degrade model safety (CrowdStrike: models generate 50% more insecure code when prompts mention Tibet or Uyghurs).
Export control exposure: Reuters reported DeepSeek has H100s procured after the US ban. V4 reportedly trained on Nvidia Blackwell (most restricted class). If confirmed, DeepSeek directly benefited from export control evasion.
What Others Say
Safety assessments (unanimous -- all catastrophic):
- Cisco/UPenn: 100% jailbreak success rate on HarmBench. "Not a single harmful prompt blocked."
- NIST CAISI: 94% jailbreak success rate. 12x more susceptible to agent hijacking than US models. Echoes 4x as many CCP narratives.
- Enkrypt AI: 11x more likely to generate harmful content than o1. 83% bias attack success. 78% insecure code generation success.
- AI Lab Watch: 1% overall score (lowest of all labs). FLI AI Safety Index: D grade.
- Qualys: 58% jailbreak failure rate in their testing; 18 distinct attack types.
- Holistic AI: once jailbroken, R1 continues answering all subsequent questions without restriction -- no safety reset.
Military adoption: Jamestown Foundation analysis of PLA procurement documents shows DeepSeek adoption for "intelligentized warfare" across the C4ISR chain -- drone swarms, electronic warfare, surveillance integration. NUDT benchmarks claim DeepSeek reduces energy consumption ~40% vs GPT-4 while maintaining accuracy. Shenyang Aircraft Design Institute (J-15, J-35 fighter jets) confirms using DeepSeek. Public security systems embedding DeepSeek for real-time facial recognition, crowd behavior analysis, and "proactive risk identification."
Distillation/IP theft: OpenAI (Jan 2025): "We are aware of and reviewing indications that DeepSeek may have inappropriately distilled our models." Early DeepSeek models answered "I'm ChatGPT" when asked their identity. Anthropic (Feb 2026): accused DeepSeek of using 24K fake accounts to harvest Claude training data. CSIS frames this strategically: the real question is not fairness but whether US firms have any mechanism to prevent this. The answer appears to be no.
Privacy/security: Feroot Security discovered hidden code in DeepSeek's web login page connecting to China Mobile (state-owned telecom banned in US, Pentagon-linked). Independently verified by University of Calgary and UC Berkeley. Wiz Research found an exposed ClickHouse database with 1M+ chat records, API keys, plaintext data -- "completely open and unauthenticated."
Foreign policy bias: CSIS benchmark testing found DeepSeek-V3 recommends "more hawkish, escalatory" responses in crisis scenarios, "particularly acute in scenarios involving free, Western countries." This bias is statistically significant vs GPT-4o and Claude.
Strongest defense: Eric Hartford ("The Demonization of DeepSeek"): argues the model weights are "just safetensor data" that "don't phone home, don't spy, don't exfiltrate data." The real safety issues are in deployment infrastructure, not the weights. The NIST report is "a political hit piece disguised as science." Several Stanford HAI faculty also praise the open-source contribution to research.
Most balanced: CSIS (Gregory Allen) argues export controls didn't fail -- they were poorly implemented initially. DeepSeek's efficiency gains are part of a preexisting trend (4x algorithmic improvement per year), not evidence that compute doesn't matter. The biggest impacts of export controls are forward-looking. Liang himself acknowledges the chip embargo is his biggest challenge.
What's Absent
- No safety infrastructure at all. No RSP, no FMSP, no safety team, no safety commitments, no model cards, no red-team reports, no safety publications, no engagement with the AI safety community. Among frontier labs, this is unique.
- No governance. One-man control (Liang, 84% through shells), no independent board, no ethics review, no whistleblower policy, no external advisory body. The governance structure would be inadequate for a startup; for a frontier lab with military applications and 96M users, it is alarming.
- No deployment restraint. Zero examples of any model or capability being withheld or delayed for safety reasons. Every model appears to have been released without safety-motivated constraints.
- No response to criticisms. DeepSeek has never publicly addressed the 100% jailbreak rate, the database exposure, the China Mobile connection, the military procurement, or any other safety finding.
- No financial transparency. All financial data comes from estimates by outside analysts. No public financial statements for either DeepSeek or High-Flyer.
- No employee voices. Zero direct employee perspectives in the entire evidence base. No departures, no public statements from anyone but Liang.
Recommended Reading
Liang Wenfeng interview (36Kr/Waves, Jul 2024, translated Jan 2025) -- The most candid, unfiltered look at how DeepSeek thinks. Liang speaks freely about AGI, China's innovation gap, why open-source is "cultural not commercial," and why he prefers hiring curious newcomers over experienced researchers. https://jiexu.substack.com/p/interview-deepseek-founder-liang-wenfeng-a-journey-by-curiosity
CSIS: "DeepSeek, Huawei, Export Controls, and the Future of the U.S.-China AI Race" (Jul 2025) -- The most rigorous analytical treatment. Debunks the $5.6M myth, explains High-Flyer's hedge fund heritage, covers chip smuggling and Huawei alternatives, makes the strongest case for why export controls still matter. Contains previously private information about SMIC yields and Huawei chip production. https://www.csis.org/analysis/deepseek-huawei-export-controls-and-future-us-china-ai-race
Cisco Security Assessment (Jan 2025) -- Two pages that say everything about DeepSeek's safety posture: 100% jailbreak success rate, zero harmful prompts blocked. https://blogs.cisco.com/security/evaluating-security-risk-in-deepseek-and-other-frontier-reasoning-models
Eric Hartford: "The Demonization of DeepSeek" -- The strongest counterargument. Argues the weights are inert data, the real issues are deployment infrastructure, and the NIST report is politically motivated. Even if you disagree, this steelmans the pro-DeepSeek case. https://erichartford.com/the-demonization-of-deepseek
Jamestown Foundation: PLA adoption of DeepSeek (Oct 2025) -- Detailed analysis of military procurement documents showing how open-weight models flow into authoritarian military and surveillance applications with no mechanism for prevention. https://jamestown.org/deepseek-use-in-prc-military-and-public-security-systems/