Google DeepMind

Frontier Lab

Hassabis/Legg. Frontier Safety Framework.

Founded: 2010
HQ: London, UK
Team: 7,600
Structure: limited company (UK)
Model: Product Revenue

Theory of Change

Google DeepMind's theory of change is that building AGI -- which Hassabis defines as AI capable of genuine scientific hypothesis generation, not just economically useful labor -- is the fastest path to solving humanity's most pressing problems. In Hassabis's words: "I'd be very worried about society today if I didn't know that something as transformative as AI was coming down the line. I firmly believe that. It's almost like the cavalry."

The safety theory of change is that (a) alignment research is commercially motivated because misalignment is currently the bottleneck to useful products, (b) embedding safety researchers inside the lab building AGI gives them maximum influence over deployment decisions, and (c) the Frontier Safety Framework creates thresholds that trigger mitigations before dangerous capabilities emerge. The 145-page "Approach to Technical AGI Safety and Security" paper (April 2025) outlines defense-in-depth: amplified oversight, robust training, interpretability, and system-level security across four risk areas (misuse, misalignment, mistakes, structural risks).

Co-founder Shane Legg, who has been publicly concerned about AI existential risk since 2011, leads the AGI Safety Council. Allan Dafoe, founding director of GovAI, left academia to join GDM because he believes "Demis and Shane are the right leaders" and his impact is maximized by advising influential decision-makers from inside the lab. Rohin Shah frames the core argument: "Alignment work is sort of the bottleneck to actually getting useful products out into the world."

What They Do

Products. Gemini model family (3 Pro released Nov 2025) now matches or exceeds state-of-the-art on most benchmarks. 650+ million monthly active users. ChatGPT still holds 64.5% market share vs Gemini's 21.5%. Google embeds AI across Search (8.5B queries/day), Workspace (1B+ users), Android (2B devices), and Cloud ($70B annual run rate).

Science. AlphaFold predicted 200M+ protein structures, freely available, used by 3M+ researchers in 190 countries. Nobel Prize Chemistry 2024 for Hassabis and Jumper. Isomorphic Labs (drug discovery spinoff, raised $600M) entering human clinical trials with AI-designed drugs. AlphaEvolve solved open mathematical conjectures. Weather Lab outperformed physics-based models for cyclone forecasting.

Safety Research. Frontier Safety Framework published in three versions (June 2024 to Jan 2026) with Critical Capability Levels in autonomy, biosecurity, cybersecurity, ML R&D, and manipulation. Published 145-page AGI safety paper. Gemma Scope (400+ sparse autoencoders for interpretability). Scalable oversight research. Doubly-efficient debate protocol. MONA for reward hacking mitigation. Uniquely among frontier labs, GDM explicitly identifies deceptive alignment as a risk class.

Commitments Record. Signed CAIS extinction risk statement, Seoul Safety Commitments. Released Gemini 2.5 Pro without model card, violating White House, G7, and Seoul commitments (60 UK MPs accused Google of "breach of trust"). February 2025: dropped 2018 AI Principles pledge against weapons and surveillance, co-authored by Hassabis himself. Gemini 3 safety report withheld key safety numbers including manipulation efficacy data. Earlier precedent: in 2015-2017, DeepMind's Streams app accessed 1.6 million NHS patient records without adequate consent, leading to ICO findings of non-compliance. The data was subsequently transferred to Google Health, contradicting privacy assurances -- establishing a pattern of governance failures around sensitive data that predates the current safety structures.

Key People

Demis Hassabis (CEO, co-founder): Neuroscience PhD, chess prodigy, Nobel Laureate. Identifies as "a scientist first." Sets a higher bar for AGI than competitors ("Could an AI have come up with general relativity?"). Co-authored blog post dropping Google's weapons pledge. Calls Sundar Pichai daily.

Shane Legg (co-founder, Chief AGI Scientist): PhD in machine super intelligence. Concerned about AI x-risk since 2011. Predicted 50% AGI by 2028. Leads AGI Safety Council. The most x-risk-concerned co-founder at any major frontier lab.

Notable departures. Geoffrey Hinton left Google May 2023 to warn about AI risks: "I want to talk about AI safety issues without having to worry about how it interacts with Google's business." Won Nobel Physics 2024. Mustafa Suleyman (co-founder) departed 2019 amid bullying complaints, now CEO of Microsoft AI. 142 departures after the Brain/DeepMind merger. All 8 authors of the transformer paper ("Attention Is All You Need") have left. Noncompete enforcement (6-12 months, UK) is controversial. Counterpoint: 20% of 2025 AI hires were returning employees.

Safety team. ~30-50 people in core AGI alignment (Anca Dragan leading, Rohin Shah, Allan Dafoe, Dave Orr). Growing ~37-39% per year. Total GDM headcount: 6,000-7,600. No published safety-to-total ratio.

Money and Incentives

Corporate structure. Wholly-owned subsidiary of Alphabet Inc. (market cap ~$2.4T). No independent financials, no independent legal structure. DeepMind's 2021 bid for independent legal structure was rejected by Alphabet. All decisions ultimately answer to Alphabet's board and shareholders.

Scale of investment. Alphabet planned 2026 capex: $175-185B, mostly for AI compute. Pre-merger DeepMind (2020): GBP 826M revenue, all from internal Google payments. Google Cloud: $70B annual run rate, $240B backlog. Google designs its own TPU chips (Ironwood: 4,614 TFLOPS per chip), giving GDM a structural hardware advantage no competitor can replicate.

Revenue model. AI is embedded across Google's existing products, not a standalone business. This means Google can afford to move cautiously -- but competitive dynamics with OpenAI drive urgency regardless.

Military contracts. Project Nimbus: $1.2B joint contract with Amazon providing cloud/AI to Israeli government and military. Google's own internal report acknowledged "Google Cloud Services could be used for, or linked to, the facilitation of human rights violations." Third-party consultant recommended withholding AI tools. Google signed anyway. Contract may extend up to 23 years. Google fired 50+ employees who protested. At least 5 UK DeepMind staff resigned. Google restricted political discussion on internal forums.

Weapons pledge reversal. In February 2025, Google removed its 2018 AI Principles pledge against weapons and surveillance. The blog post was co-authored by Hassabis. Justified by "global competition." Human Rights Watch: "Google announces willingness to develop AI weapons." Google employees' 2018 petition (thousands signed, dozens resigned) that originally created the pledge has been effectively overridden.

Hedging. Google invested $2B in Anthropic -- a company founded by people who left over safety concerns, now a direct competitor. This hedges Google's bet: if GDM's approach fails, Google still has a financial interest in the safety-focused alternative.

No external funding. Zero Coefficient Giving/Open Phil grants (expected for a Google subsidiary). The separate "DeepMind Foundation" (EIN 822828614) has ~$856K assets and makes $500-$1000 grants in medical research -- irrelevant to GDM operations.

What Others Say

FLI AI Safety Index (Winter 2025): Grade C (2.08/4.0), third behind Anthropic (C+) and OpenAI (C+). Recommendations: strengthen risk-assessment rigor, make governance structures more actionable, reduce lobbying against state-level AI safety regulations.

Zvi Mowshowitz on the Frontier Safety Framework: "This document is weak and unambitious. It is disappointing relative to my expectations." Key weaknesses: the word "commit" never appears; checks only every 6x compute (widest gap of top 3 labs); mitigation plans are empty; misalignment addressed as "future work."

Zvi on Gemini 3: "The model is seriously misaligned in many ways... prone to hallucinations, crafting narratives, glazing." Safety report "repeatedly hides the football." Dan Hendrycks: "On safety -- jailbreaks, bioweapons assistance, overconfidence, deception, agentic harm -- Gemini is worse than GPT, Claude, and Grok."

Semafor: "Hassabis paradox: warning about AI race while leading the race."

Stuart Russell (FLI panel): "AI CEOs claim they know how to build superhuman AI, yet none can show how they'll prevent us from losing control... I'm looking for proof that they can reduce the annual risk of control loss to one in a hundred million. Instead, they admit the risk could be one in ten, one in five, even one in three."

Rohin Shah (GDM insider): "I'm not really sold on [doom narratives]... I expect AI systems will become more powerful relatively continuously... I would be generally in favour of the entire world slowing down on AI progress." On alignment difficulty: "Nobody has very compelling arguments that AI will be dangerous by default, or that it will be safe by default. We just don't know."

Allan Dafoe (GDM insider): Technology is "unstoppable" in the sense that competitive dynamics force adoption, "but the FORM it takes is shapeable." His theory of impact: "advising influential decision makers" like Hassabis and Legg who have "the right character."

What's Absent

No independent safety governance. Unlike Anthropic (LTBT with board veto) or even pre-restructuring OpenAI (nonprofit oversight), GDM has no structural mechanism for safety to override commercial pressure. All governance is internal to Alphabet. When asked what happens if DeepMind disagrees with Alphabet on safety, COO Lila Ibrahim said: "We haven't had that issue yet."

Secret ethics board. The board created as a condition of the 2014 acquisition has never been publicly disclosed. Hassabis says it's "all confidential." Nobody knows who was on it, whether it still exists, or what it did.

No financial transparency. As an Alphabet subsidiary, GDM's budget, safety spending, and resource allocation are all opaque. No way to verify claims about safety investment.

No published safety-to-total headcount ratio. Estimates suggest 30-50 people in core alignment out of 6,000-7,600 total. No public commitment to maintain or grow this ratio.

No safety-driven non-deployment decision. GDM has never publicly declined to release a model for safety reasons. The FSF's "aim" language (not "commit") makes halt commitments unenforceable.

Missing safety data. Gemini 3 safety report withheld manipulation efficacy numbers, propensity rates, and detailed external evaluation results. Pattern of selective disclosure.

No policy for disagreements with Alphabet. No escalation mechanism, no safety veto, no dead man's switch for when commercial and safety interests conflict.

Stated Theory of Change

Google DeepMind's stated theory is a three-part argument:

AGI will be transformative and necessary. Hassabis frames AGI as "the cavalry" -- the technology needed to solve climate change, cure diseases, and enable "radical abundance." Without it, he would be "very worried about society." This makes building AGI a moral imperative, not just a commercial one.
Safety is best achieved from inside the frontier lab. By embedding safety researchers directly in the lab building AGI, they have maximum influence over deployment decisions. Dafoe's theory: "advising influential decision makers" who have "the right character." Shah's argument: alignment is commercially motivated because misalignment is currently the bottleneck to useful products.
A framework of escalating commitments will catch dangerous capabilities before they cause harm. The Frontier Safety Framework defines Critical Capability Levels with thresholds that trigger security and deployment mitigations. The 145-page AGI safety paper outlines defense-in-depth across four risk areas.

The implicit assumption underlying all three parts: the people running GDM (Hassabis, Legg) genuinely care about safety and will use their influence to make the right calls when commercial and safety interests conflict.

Revealed Theory of Change

Actions tell a different story than statements. The revealed theory appears to be:

"Move as fast as corporate constraints allow, publish safety research to maintain credibility, and trust that alignment-as-product-quality will keep pace with capability scaling."

Evidence for the gap between stated and revealed theories:

Weapons pledge dropped (Feb 2025): Hassabis co-authored the blog post removing Google's 2018 commitment against weapons and surveillance AI. Two years earlier, he warned about "not moving fast and breaking things." The geopolitical justification ("world's become a much more dangerous place") applies equally to all labs -- it doesn't explain why GDM specifically reversed course.
Model card delays/omissions: Gemini 2.5 Pro released without the safety documentation GDM committed to at the White House, G7, and Seoul. This was not an oversight -- it was a speed-over-transparency decision.
Safety data withheld: Gemini 3's safety report systematically hides manipulation efficacy, propensity rates, and detailed evaluation results. If the numbers were good, they would be published.
Project Nimbus signed despite internal warnings: Google's own consultants recommended withholding AI from the Israeli military. Google signed a potentially 23-year contract anyway. Protesters were fired. Internal discussion was restricted.
Independence bid rejected and not pursued further: After Alphabet rejected DeepMind's bid for independent legal structure in 2021, Hassabis appears to have accepted the outcome rather than escalating publicly.

The revealed theory of change is more honest than it sounds. Many GDM researchers likely believe that being inside Google -- with access to compute, data, and deployment infrastructure -- gives them more total safety impact than they would have independently, even if they lose individual battles (weapons pledge, model card timing). This is Allan Dafoe's explicit theory. The question is whether the battles being lost are ones that matter.

Key Assumptions

Assumption 1: Hassabis and Legg will have the power to override commercial pressure on safety.

Evidence for: Hassabis is CEO with Nobel Prize credibility. Legg leads AGI Safety Council. Both signed CAIS statement.
Evidence against: Weapons pledge dropped. Independence bid rejected. Pichai has ultimate authority. Hassabis calls Pichai daily -- who is influencing whom?
Testable: Would become clear if GDM ever delayed or halted a release for safety reasons against commercial pressure. This has not happened.
If wrong: Safety decisions are effectively Alphabet board decisions, made by people optimizing for shareholder returns.

Assumption 2: Alignment research at a frontier lab has more impact than external safety work.

Evidence for: Shah's point about needing access to frontier models for safety research. Dafoe's "in the room where it happens" argument.
Evidence against: The Gebru/Mitchell precedent shows Google fires researchers whose findings embarrass products. The "Right to Warn" letter shows employees feel unable to speak freely.
Testable: Does GDM's safety research actually change deployment decisions? So far, no public example exists.
If wrong: Safety researchers at GDM are providing legitimacy to unsafe practices rather than preventing them.

Assumption 3: The FSF will function as intended when CCLs are triggered.

Evidence for: The framework exists and is updated (3 versions). Uniquely addresses deceptive alignment. Applies to all of Google.
Evidence against: Uses "aim" not "commit." 6x compute gap is the widest among top labs. No concrete mitigation plans. Never been tested.
Testable: Will be tested when a model actually triggers a CCL. Has not happened yet.
If wrong: The FSF is a public relations document, not a safety framework.

Assumption 4: Alignment research will scale with capability scaling (the "commercial alignment" thesis).

Evidence for: Currently true -- misalignment causes product failures, creating commercial incentive for alignment.
Evidence against: This only holds while models are weak enough that misalignment causes visible product failures, not invisible catastrophes. As models become more capable, misalignment may become harder to detect but more dangerous.
If wrong: The commercial case for alignment evaporates precisely when alignment matters most.

Strengths

Talent depth. GDM has arguably the deepest bench of AI safety talent at any frontier lab: Shah, Dafoe, Dragan, Orr, Nanda, Gabriel, plus Legg as executive sponsor. Multiple of these people (Shah, Dafoe) are known for intellectual honesty and genuinely seem to be trying to do good work.

Scientific credibility. AlphaFold is the strongest real-world AI safety argument: proof that AI can produce massive public benefit. The Nobel Prize gives Hassabis credibility that other frontier lab CEOs lack. GDM's research culture, while compressed by the merger, still produces fundamental science.

Infrastructure advantage. Vertical integration of custom chips (TPUs), data centers, and models gives GDM structural advantages no competitor can replicate quickly. This means GDM can theoretically afford to invest in safety without existential competitive pressure.

Framework applies to all of Google. Unlike OpenAI's framework (which doesn't apply to Microsoft), GDM's FSF covers the entire Google ecosystem. This is a genuine structural advantage.

Deceptive alignment as explicit risk class. GDM is the only frontier lab that explicitly identifies deceptive alignment in its safety framework. This suggests at least some of the right threat models are informing the framework design.

Weaknesses and Risks

No structural independence from Alphabet. This is the fundamental problem. Every other weakness flows from it. Hassabis cannot make a safety decision that costs Alphabet money without Pichai's approval. The weapons pledge reversal demonstrates that when commercial interests and safety commitments conflict, commercial interests win.

Pattern of broken commitments. The 2018 weapons pledge lasted 7 years before being dropped. Model card commitments were violated within a year of being made. The Seoul Commitments were not honored for Gemini 2.5 Pro. Each broken commitment erodes the credibility of all remaining commitments.

Safety opacity. GDM systematically withholds safety data: manipulation efficacy numbers, propensity rates, detailed evaluation results. The FLI Safety Index specifically recommended more transparent and independent evaluations. The Gemini 3 safety report "hides the football" repeatedly.

Talent hemorrhage. 142 departures post-merger. All transformer paper authors gone. Noncompete enforcement creates resentment. The researchers who produced AlphaFold worked under conditions (research freedom, minimal commercial pressure) that no longer exist.

"Aim" not "commit." The FSF's aspirational language means no commitment is enforceable. Combined with no independent governance, this means safety is a preference, not a constraint.

Suppression of internal dissent. Gebru/Mitchell firings. 50+ employees fired for protesting Nimbus. Political discussion restricted on internal forums. "Right to Warn" letter went unanswered. This environment is hostile to the kind of internal challenge that safety requires.

Cross-References

vs. Anthropic: Anthropic was founded by people who left Google over exactly the concerns that define GDM's weaknesses: structural independence, safety culture, deployment decisions. Anthropic has the LTBT, independent board, RSP with "commit" language, and safety as core identity. GDM has none of these structural features. But GDM has compute advantages Anthropic cannot match and scientific credibility (AlphaFold, Nobel) that Anthropic doesn't have.

vs. OpenAI: Both are part of larger corporate structures (Alphabet and Microsoft respectively). OpenAI's Preparedness Framework checks every 2x compute (vs GDM's 6x). OpenAI has been more publicly messy (board drama, departures), which paradoxically provides more visibility into internal tensions. GDM's smooth exterior may hide more than it reveals.

Allan Dafoe connection to GovAI: Dafoe founded GovAI, then left for GDM. This represents a theory-of-change bet: that insider influence at a frontier lab matters more than independent governance research. If GDM's safety culture degrades, this bet looks worse in hindsight.

Google's $2B Anthropic investment: This hedging strategy means Google benefits if the safety-first approach succeeds. It also means Google is funding a competitor that makes GDM look less safety-conscious by comparison.

What Would Change This Assessment

Positive updates (would increase confidence in GDM's safety approach):

GDM publicly delays or halts a model release for safety reasons, against commercial pressure
FSF language changes from "aim" to "commit" with enforceable mechanisms
Independent safety board with veto power established
Detailed, unredacted safety reports published for all major releases
Hassabis advocates publicly for regulation, even if it disadvantages Google

Negative updates (would decrease confidence):

Another major commitment broken (Seoul, UK AISI partnership, etc.)
Senior safety researcher departs publicly citing safety concerns
GDM deploys a model that triggers a CCL but proceeds anyway
Further suppression of internal safety dissent
Alphabet pressures GDM to reduce safety team or budget

Self-Critique

Weakest claim: The assertion that GDM's safety commitments are primarily performative. It is genuinely possible that the FSF, AGI safety paper, and safety research program represent a serious institutional effort that simply can't be fully public due to competitive dynamics. Rohin Shah and Allan Dafoe are credible, thoughtful people who appear to genuinely believe their work matters. If they are right that insider influence is the highest-impact strategy, then this analysis overstates the importance of structural independence.

Potential bias: This analysis may be overly influenced by the weapons pledge reversal and Project Nimbus, which are vivid, concrete examples that anchor criticism. GDM's day-to-day safety work -- which is less visible -- may be more substantial than these high-profile failures suggest.

What I would most want to check: Internal resource allocation data. If GDM spends 10%+ of its budget on safety research and evaluation, the picture looks very different than if safety is 1%. Without financial transparency, this cannot be assessed.

What a thoughtful defender would say: "GDM has the deepest safety research team at any frontier lab, produces genuinely important safety research (debate, interpretability, dangerous capability evaluations), and the FSF -- even with 'aim' language -- represents more systematic safety thinking than most labs have. The weapons pledge and Project Nimbus are Google decisions, not GDM decisions. Hassabis and Legg are doing the best they can inside a corporate structure they don't control, and their presence at the frontier makes the overall trajectory safer than if a less safety-conscious team were building these systems."

What a thoughtful critic would say: "GDM's safety work provides legitimacy to a corporation that has broken every safety commitment it has made. Hassabis co-authored the blog post dropping the weapons pledge. The FSF uses 'aim' not 'commit.' Safety data is systematically withheld. The most important safety decision GDM ever had to make -- whether to accept corporate capture or insist on independence -- was made in 2021 when they accepted Alphabet's rejection of independent legal structure. Everything since then is operating within constraints that make genuine safety governance impossible."

Connected to (10)

UK AI Security Institutecollaborator

METRevaluates

Microsoft AIstaff to · Mustafa Suleyman

Center for Human-Compatible AIstaff from · Anca Dragan, Rohin Shah

Character.AIstaff to · Noam Shazeer, Daniel De Freitas

Anthropicstaff to · Dario Amodei, Daniela Amodei, others

Centre for the Governance of AIstaff from · Allan Dafoe

Cooperative AI Foundationcollaborator · Allan Dafoe, Thore Graepel

Isomorphic Labsspun off from · Demis Hassabis

OpenAIstaff to · Ilya Sutskever, others

Sources (75)

Every URL that was read during research.

1.Google DeepMind - Wikipediaen.wikipedia.org
2.About Google DeepMinddeepmind.google
3.Responsibility & Safetydeepmind.google
4.Introducing the Frontier Safety Frameworkdeepmind.google
5.Google DeepMind strengthens the Frontier Safety Frameworkdeepmind.google
6.Demis Hassabis — Scaling, superhuman AIs, AlphaZero atop LLMs, AlphaFolddwarkesh.com
7.Shane Legg (DeepMind Founder) — 2028 AGI, superhuman alignment, new architecturesdwarkesh.com
8.Transcript for Demis Hassabis: Future of AI, Simulating Reality, Physics and Video Games | Lex Fridman Podcast #475 - Lex Fridmanlexfridman.com
9.Shane Legg - Wikipediaen.wikipedia.org
10.Demis Hassabis - Wikipediaen.wikipedia.org
11.Demis Hassabis Is Preparing for AI’s Endgametime.com
12.Allan Dafoe on why technology is unstoppable & how to shape AI development anyway | 80,000 Hours80000hours.org
13.Exclusive: 60 U.K. Lawmakers Accuse Google of Breaking AI Safety Pledgetime.com
14.DeepMindailabwatch.org
15.DeepMind UK staff move to unionise amid anger over defence deals and Israel linkscomputing.co.uk
16.Google DeepMind forms a new org focused on AI safety | TechCrunchtechcrunch.com
17.AI Safety Index Winter 2025 - Future of Life Institutefutureoflife.org
18.Mustafa Suleyman - Wikipediaen.wikipedia.org
19.Leading AI Companies Get Lousy Grades on Safetyspectrum.ieee.org
20.The Google Brain-DeepMind merger is probably good for Google. It might not be for us | Fortunefortune.com
21.Announcing Google DeepMinddeepmind.google
22.Taking a responsible path to AGIdeepmind.google
23.Google DeepMind Slammed by Protesters Over Broken AI Safety Promisetechtimes.com
24.Exclusive: Workers at Google DeepMind Push Company to Drop Military Contractstime.com
25.Google Worried It Couldn’t Control How Israel Uses Project Nimbus, Files Revealtheintercept.com
26.Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters | 80,000 Hours80000hours.org
27.What one founder's past says about AI's future : TED Radio Hournpr.org
28.Google violated AI safety commitments British lawmakers say in an open letter | Fortunefortune.com
29.PauseAI Presents: The Google DeepMind Protestpauseai.info
30.Letter to Sir Demis Hassabispauseai.info
31.Gemini 3: Model Card and Safety Framework Reportthezvi.substack.com
32.Advancing Gemini's security safeguardsdeepmind.google
33.Alphabet is confident about plans to double capex spending to a possible $185 billion—but it’s keeping CEO Sundar Pichai up at night | Fortunefortune.com
34.DeepMind's 145-page paper on AGI safety may not convince skeptics | TechCrunchtechcrunch.com
35.Google removes pledge to not use AI for weapons, surveillancecnbc.com
36.Google Announces Willingness to Develop AI for Weaponshrw.org
37.Anca Dragan named Head of AI Safety and Alignment at Google DeepMind - EECS at Berkeleyeecs.berkeley.edu
38.An Approach to Technical AGI Safety and Securityarxiv.org
39.On DeepMind's Frontier Safety Frameworkthezvi.substack.com
40.Frontier Safety Frameworks: Comprehensive Guide & Key Comparisons | Enkrypt AIenkryptai.com
41.AlphaFolddeepmind.google
42.Read Google DeepMind’s new paper on responsible artificial general intelligence (AGI).blog.google
43.Frontier AI Safety Policiesmetr.org
44.Google DeepMind’s Demis Hassabis and the paradox of AI progresssemafor.com
45.Google DeepMind CEO Demis Hassabis on AGI and AI in the Militarytime.com
46.Updating the Frontier Safety Frameworkdeepmind.google
47.Deepening our partnership with Google DeepMind | AISI Workaisi.gov.uk
48.Gemma Scope: helping the safety community shed light on the inner workings of language modelsdeepmind.google
49.Google DeepMind agrees to sweeping research collaboration with the U.K. government | Fortunefortune.com
50.Deepening AI Safety Research with UK AI Security Institute (AISI)deepmind.google
51.Google DeepMind has a new way to look inside an AI’s “mind”technologyreview.com
52.Google DeepMind wants to know if chatbots are just virtue signalingtechnologyreview.com
53.We read the paper that forced Timnit Gebru out of Google. Here’s what it says.technologyreview.com
54.Google's latest Gemini 2.5 Pro AI model is missing a key safety report in apparent violation of promises the company made to the U.S. government and at international summits | Fortunefortune.com
55.Google's latest AI model report lacks key safety details, experts say | TechCrunchtechcrunch.com
56.Google DeepMind After the Merger: Nobel Prizes, Bleeding Talent, and a $185 Billion Bet That Cannot Faildigidai.github.io
57.Why top AI talent is leaving Google’s DeepMindsifted.eu
58.Google axes its promise not to use AI for weapons or surveillance just weeks into the Trump presidency | Fortunefortune.com
59.Google erases promise not to use AI technology for weapons or surveillance | CNN Businesscnn.com
60.Geoffrey Hinton tells us why he’s now scared of the tech he helped buildtechnologyreview.com
61.Scalable AI Safety via Doubly-Efficient Debatedeepmind.google
62.Common Elements of Frontier AI Safety Policiesmetr.org
63.The ethics of advanced AI assistantsdeepmind.google
64.Deepmind Foundation | Cupertino, CA | 990 Report | Instrumentlinstrumentl.com
65.DeepMind now has an AI ethics research unit. We have a few questions for it... | TechCrunchtechcrunch.com
66.New role: AGI Alignment researchmistaketheory.substack.com
67.DeepMind CEO Demis Hassabis Urges Caution on AItime.com
68.Employees Say OpenAI and Google DeepMind Are Hiding Dangers from the Publictime.com
69.AI 2026: Sundar Pichai is Tasked With Preserving Googlebrief.bismarckanalysis.com
70.Google faces new suit over DeepMind NHS patient data scandaltechcrunch.com
71.Gemini 3.1 Pro - Model Carddeepmind.google
72.Researchdeepmind.google
73.Careers at Google DeepMinddeepmind.google
74.Building a culture of pioneering responsiblydeepmind.google
75.MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hackingdeepmind.google