Future of Humanity Institute

Research

CLOSED. Bostrom. Spawned the field.

Founded: 2005
HQ: Oxford, UK
Structure: university-affiliated
Model: Grants

Status: CLOSED (April 16, 2024, after 19 years)

Theory of Change

FHI's theory of change was field-building through intellectual incubation. In their own words:

"We attempted to notice overlooked topics deserving of research and attention early, germinating it in the sheltered FHI greenhouse; showing that progress could be made; coalescing a field and setting research directions; attracting bright minds to it; and once it's established enough, setting it free, and moving onto the next seedlings." — Anders Sandberg, Final Report

The causal chain: (1) identify neglected topics of existential importance, (2) demonstrate rigorous research is possible on them, (3) attract talent and funding, (4) build a research community, (5) spin the field off when it can sustain itself, (6) move to the next neglected topic.

This was enacted across multiple domains: human enhancement ethics (2005-2010), existential risk as a systematic category (2002-2020), AI safety and alignment (2011-present), AI governance (2017-2021), biosecurity (2016-2020), digital minds (2020-2024).

Bostrom articulated the research philosophy early: "We pursue an opportunistic research agenda, in that we focus on those questions that are amenable to analysis, and where new insights would bring the greatest payoff in terms of improving humanity's ability to navigate its future" (April 2006).

What They Did

Intellectual output. FHI coined or formalized an extraordinary number of concepts now standard in AI safety and EA: existential risk (as systematic category), information hazards, the unilateralist's curse, existential hope, crucial considerations, the vulnerable world hypothesis, anthropic shadow, the simulation argument, the paperclip maximizer, comprehensive AI services. The terminology itself shaped how the field thinks.

Books. Superintelligence (Bostrom, 2014) became a NYT bestseller, was praised by Musk and Gates, and catalyzed AI safety as a serious field. The Precipice (Ord, 2020) quantified existential risks and directly influenced UK and UN policy. Deep Utopia (Bostrom, 2024) explored the positive scenario.

Organizations seeded. GovAI (Centre for the Governance of AI), AI Impacts, Giving What We Can, the EA movement's intellectual foundations, LessWrong (via Overcoming Bias), CSER (co-founded by FHI's Sean O hEigeartaigh). The Rationalist community traces significant lineage to FHI-associated work.

People trained. Jan Leike (created RLHF, led alignment at OpenAI, now at Anthropic). Allan Dafoe (founded AI governance as a field, now Google DeepMind). Owain Evans (TruthfulQA benchmark). Katja Grace (AI Impacts, expert surveys). Stuart Armstrong (Aligned AI). Andrew Snyder-Beattie (now Coefficient Giving biosecurity). Leopold Aschenbrenner (interned at FHI, later at OpenAI).

Policy influence. Bostrom briefed the UN General Assembly (2015). Multiple staff presented to UK Parliament. Ord contributed to the UN 2020 Human Development Report. Ord was quoted by the UK PM at the UN General Assembly (2021). FHI co-wrote the "Future Proof" UK resilience report.

COVID response. The biosecurity team launched an Epidemic Forecasting Project reaching 10,000 users/day. Brauner et al. (2021) on non-pharmaceutical interventions became one of the ten most-cited papers in Science's history.

Prolificacy. In its first 18 months alone (2005-2007): 40 journal papers, 43 articles and chapters, 5 books — with only 3 salaried researchers plus the director. Peak team size was approximately 40 (2019-2020).

Key People

Nick Bostrom — Founding director (2005-2024). Swedish philosopher, PhD LSE. Described by a colleague as having "the long tail of intelligence" that was visible only in discussion. Worked nights, maintained a mysterious FHI "scorecard" on the whiteboard. Brilliant researcher but poor delegator who "prioritized his own research over FHI's relationship with the university" (per Cotton-Barratt's resignation letter). Post-FHI: founded Macrostrategy Research Initiative. No longer Oxford faculty.

Anders Sandberg — First hire (2006), Senior Research Fellow. Bostrom's intellectual complement — "ebullient and whimsical" to Bostrom's "ultra-serious." Authored the Final Report, the institutional memory of FHI. Now at Mimir Center in Stockholm.

Toby Ord — Senior Research Fellow. Computer science turned philosopher. Founded Giving What We Can, co-founded EA. Published The Precipice, the most policy-impactful FHI output. Was notably skeptical of SBF/FTX involvement. Post-FHI: AI governance at Oxford Martin School.

Notable departures: Owen Cotton-Barratt resigned August 2021, criticizing Bostrom's management in a letter that leaked to the faculty. Tanya Singh (senior ops, sometimes worked 22-hour days) departed June 2022 after the hiring freeze. GovAI's entire team spun out in 2021 "to escape bureaucratic restrictions."

Money and Incentives

Total identifiable funding (2005-2024): Approximately $30-35M over 19 years from all sources.

Revenue breakdown by source:

Open Philanthropy: ~$20.9M (2016-2021) — dominant funder post-2016. Flagship grant: $12.25M (2018) for global catastrophic risks. Several grants investigated by Nick Beckstead, himself a former FHI fellow.
James Martin / Oxford Martin School: ~GBP 2.5M (2005-2016, 60% of early funding)
Amlin (reinsurance): ~GBP 0.9M (2013-2016)
European Research Council: EUR 2M Advanced Grant (2015)
FLI/Musk: ~GBP 1M+ (2015)
Survival and Flourishing Fund: ~$250K
Private donors: ~GBP 760K pre-2016

Business model: 100% grant/donation-dependent. No commercial revenue, no government contracts as a regular stream. All funds flowed through Oxford University. Staff on Oxford pay scales.

The Oxford money trap: By 2020, FHI had approximately GBP 10M in the bank — much of it from the unspent 2018 Open Philanthropy grant — but could not spend it because the Faculty of Philosophy had frozen hiring and fundraising. The amount ultimately returned to Open Philanthropy or disposed of after closure has not been publicly disclosed.

Institutional structure: FHI was a research centre within the Faculty of Philosophy, University of Oxford. No independent legal entity, no independent board, no independent hiring authority. This made FHI entirely dependent on the faculty's goodwill for operational decisions. GPI (Global Priorities Institute, a similar Oxford group that survived) was "less zany and more academically legible."

Funder concentration risk: Open Philanthropy provided roughly 60-70% of total post-2016 funding. Nick Beckstead (grant investigator for several major OP grants) was himself a former FHI research fellow, illustrating the tight interlocks between FHI and the EA funding ecosystem.

Key incentive dynamic: Bostrom and senior researchers wanted to "move fast, hire quickly, work with industry." Oxford wanted standard academic procedures, teaching contributions, and legible philosophical output. As Bowerman put it: "The philosophy faculty's currency is peer-reviewed papers in prestigious journals. That wasn't the currency of FHI. The currency was cool ideas that could improve the world." This mismatch, never resolved, was fatal.

What Others Say

The "physician, heal thyself" critique. Torres: "Longtermists failed to anticipate and mitigate the risks posed by Sam Bankman-Fried... failed to anticipate and mitigate the negative press that buying a huge castle would cause... weren't even able to keep FHI from being shut down. How much evidence do we need that longtermists aren't good at anticipating and mitigating risks?"

The narrowness critique. Torres: "The utopian vision that FHI developed is thoroughly Western, Baconian, and capitalistic... designed almost entirely by extremely privileged white men at elite universities and in Silicon Valley... conspicuous absence of virtually any reference whatsoever to what the future could or should look like from the perspectives of Indigenous communities, Afrofuturism, feminism" and other non-Western traditions.

The anthropomorphism critique. Quillette's review of Superintelligence after 10 years: "Many of Bostrom's arguments rest upon profoundly anthropomorphic premises... Many who work with code find the prospect of programs becoming goal-seeking, power-seeking, and 'making their own decisions' fundamentally implausible." Drew McDermott's classic critique is cited: calling a program function "GOAL" does not give it goals.

The EA/inequality critique. Jacobin: EA and FHI are "at heart a conservative movement, which attempts to present billionaires as a solution to global poverty rather than its cause." Nick Beckstead's claim that "saving a life in a rich country is substantially more important than saving a life in a poor country" is cited as representative.

The methodological critique. Reflective Altruism's epistemics series demonstrates that FHI/EA policy proposals (like strengthening the Biological Weapons Convention) are "substantially more complex than they appear" and that "diamonds in the rough are rare."

The balanced view. Andrew Maynard (Future of Being Human): FHI was "effective in pushing the bounds of thinking in ways that transcended disciplines" but its thinking placed "far greater emphasis on philosophical speculation than practical reality." The world needs boundary-spanning thinking, but grounded in humility and inclusive of diverse perspectives.

Defenders. Moorhouse: "Especially for its size, FHI was staggeringly influential." Even critics rarely dispute FHI's conceptual influence — the debates are about whether that influence was net positive.

What's Absent

No published annual financial reports in 19 years of operation
No diversity reporting or systematic engagement with non-Western perspectives on the future
Oxford Faculty of Philosophy never gave its side of the story beyond a boilerplate statement
No formal impact evaluation of FHI's counterfactual contribution
No succession plan despite 19 years of single-director governance
No published data on how the GBP 10M+ unspent grant money was ultimately disposed of
No systematic alumni tracking — claims about influence rely on anecdote
No explanation for why GPI survived identical Oxford constraints while FHI did not

Stated Theory of Change

FHI's stated mechanism: identify neglected topics of civilizational importance before others do, demonstrate that rigorous academic work is possible on them, attract top talent, build research communities, then release the mature field to sustain itself.

The specific chain to AI risk reduction: (1) pioneer concepts like existential risk and AI alignment as legitimate academic subjects, (2) produce landmark publications (Superintelligence) that shift elite opinion, (3) train researchers who go on to lead safety work at frontier labs, (4) establish AI governance as a field, (5) influence government policy through direct engagement. This is essentially an "ideas-first" theory of change — change how the world thinks, and the world will change how it acts.

Revealed Theory of Change

FHI's actions broadly tracked its stated theory. The incubation model worked multiple times: human enhancement (pioneered then handed off), AI safety (pioneered then adopted by labs), AI governance (pioneered then GovAI spun out), biosecurity (developed then became relevant during COVID). This is one of the clearest cases of stated and revealed theories of change aligning.

However, some divergences are notable:

Bostrom's personal research increasingly diverged from institutional needs. Cotton-Barratt's resignation letter criticized Bostrom for prioritizing his own work over university relations. In the final years, Bostrom was writing Deep Utopia in an Alpine chalet while FHI was dying. The revealed theory of change for the director shifted from "build an institution" to "produce my own intellectual contributions."
The ops burden revealed the actual cost of the model. FHI's theory involved shielding researchers from administrative burden. In practice, this meant operations staff worked unsustainable hours (Tanya Singh's 22-hour days) absorbing the gap between what FHI wanted and what Oxford permitted. The "carapace" model depended on sacrificial ops labor.
FHI valued Oxford's prestige more than institutional independence. Despite years of bureaucratic strangulation, FHI never seriously pursued independence. The revealed preference was for brand over autonomy — which ultimately killed the organization.
The field-building model may have reduced FHI's own necessity. By seeding GovAI, AI Impacts, and dozens of alumni into safety roles, FHI progressively reduced the marginal value of its own continued existence. Bostrom's admission that he should have "pulled the plug" in 2019-2020 suggests he sensed this.

Key Assumptions

Assumption 1: Ideas are the binding constraint on AI safety progress. FHI assumed that the primary bottleneck was conceptual — that the world lacked the intellectual frameworks to think about existential risk, and that providing those frameworks would lead to action.

Evidence for: Pre-FHI, AI safety was fringe. Post-Superintelligence, it's mainstream. Every frontier lab now has a safety team.
Evidence against: Having the concepts hasn't translated into adequate safety investment. OpenAI knows about alignment but ships capabilities faster. Ideas may be necessary but insufficient.
If wrong: FHI's legacy is that of a prescient prophet ignored — intellectually correct but practically irrelevant.

Assumption 2: Academic institutional embedding provides credibility that amplifies impact. FHI chose to remain at Oxford specifically for the prestige and credibility that enabled policy access and media attention.

Evidence for: "University of Oxford" opened doors — Bostrom addressed the UN, Ord was quoted by the PM, the New Yorker profiled FHI.
Evidence against: The academic embedding also killed FHI. GPI survived in the same environment by being "more academically legible" — suggesting FHI's approach was the problem, not Oxford's.
If wrong: Independent organizations (MIRI, Redwood) may achieve more impact per dollar by avoiding institutional constraints.

Assumption 3: Talent is rare and worth extraordinary accommodation. Bostrom's "jewels" metaphor — finding brilliant people and creating bespoke organizational structures around them. The 30% contract renewal rate reflects extreme selectivity.

Evidence for: FHI alumni disproportionately populate leadership roles across AI safety.
Evidence against: The "genius cult" model creates governance gaps, succession failures, and reputational dependency on individuals (as the Bostrom email showed).
If wrong: More distributed, less personality-dependent institutions may be more resilient.

Assumption 4: The topics FHI prioritized were the right ones. FHI's cause prioritization assumed AI risk, biosecurity, and longtermism were the most important things to work on.

Evidence for: COVID validated biosecurity work. ChatGPT validated AI risk work. These aren't fringe anymore.
Evidence against: Torres' critique — FHI's vision of "the future" was narrowly Western, ignoring non-Western perspectives, current-generation justice, and ecological ethics. The "astronomical waste" framing may be a philosophical distortion.
Testability: Partially testable by whether FHI-style x-risk frameworks continue to dominate policy or are supplanted by more pluralistic approaches.

Strengths

Proof of concept for the field-building model. FHI demonstrated that a small group of brilliant researchers at a prestigious institution can seed entire fields. AI safety, AI governance, and systematic existential risk research all trace significant lineage to FHI. This is a genuine and rare accomplishment.
Conceptual infrastructure. The vocabulary FHI created — existential risk, information hazards, crucial considerations, the unilateralist's curse — now structures how an entire community thinks. This is the kind of deep influence that persists long after the originating institution closes.
Personnel pipeline. FHI trained a generation of researchers who now hold leadership positions across AI safety, governance, and policy. Jan Leike, Allan Dafoe, Owain Evans, Katja Grace — these people's careers were shaped by FHI.
Demonstrated that unconventional research can have real policy impact. From Bostrom at the UN to Ord in Downing Street, FHI showed that "weird" philosophical questions about the far future can influence actual governance.
Intellectual courage. FHI was willing to study questions that were "obviously" fringe — and many of them turned out to be among the most important questions of the century. This willingness to be wrong about social acceptability in order to be right about importance is genuinely valuable.

Weaknesses and Risks

Catastrophic governance failure. Single director for 19 years, no board, no succession plan, no independent legal entity. Bostrom's management weaknesses (poor delegation, prioritizing personal research, neglecting university relationships) were never checked by any governance mechanism. This is the canonical failure mode of founder-led organizations.
Institutional fragility. An organization that received $20M+ from a single funder had no independent legal existence and could be shut down by a philosophy department's administrative freeze. This level of institutional vulnerability is a system design failure, not just bad luck.
Narrow intellectual base. For an org studying "the future of humanity," the perspective base was remarkably homogeneous — predominantly white, male, Western, analytically-trained philosophers. Torres' critique about absent perspectives has substance even if the TESCREAL framing is overblown.
The "physician, heal thyself" problem. FHI studied catastrophic risk but couldn't manage its own institutional risk. It failed to anticipate or mitigate the Bostrom email fallout, the SBF adjacency damage, or the Oxford bureaucratic escalation. If FHI's researchers were as good at risk assessment as claimed, they should have seen these coming.
Anthropomorphic AI risk framing may be superseded. Superintelligence's core scenario — a unified agent with goals pursuing a "decisive strategic advantage" — may be less relevant to the actual AI risk landscape, where risks emerge from diffuse systems, misuse, and organizational failures rather than rogue superintelligences.
Bostrom's personal controversies became institutional liabilities. The 1996 email and his 2023 non-apology damaged FHI's reputation in ways that a better-governed organization could have contained. The "bloodthirsty mosquitos" post-scandal comment suggests Bostrom remained oblivious to the reputational dynamics.

Cross-References

MIRI: Pursued a more technical, less academic approach to AI safety. MIRI maintained independence (no university affiliation) which gave it more operational freedom but less institutional prestige. FHI and MIRI represent complementary bets on how to reduce AI risk.
GovAI: FHI's most successful spinoff. Escaped Oxford's bureaucracy to become an independent nonprofit, now a leading AI governance institution. Ben Garfinkel (Acting Director) was FHI's AI Governance Team lead.
CSER (Cambridge): Co-founded by FHI's Sean O hEigeartaigh. Sibling institution that took a more policy-focused, less philosophically ambitious approach. Survived where FHI did not.
Open Philanthropy / Coefficient Giving: FHI's dominant funder. Multiple interlocks (Beckstead investigated grants, Snyder-Beattie now at CG). The relationship illustrates both the benefits (aligned funding) and risks (funder concentration, small-world dynamics) of EA ecosystem funding.
GPI (Global Priorities Institute): Survived at Oxford by being "more academically legible." The comparison reveals that FHI's closure was not inevitable — it resulted from specific institutional choices.

What Would Change This Assessment

Evidence that FHI's key contributions would not have happened otherwise. Currently, the counterfactual impact is asserted but not demonstrated. If someone could show that AI safety would not have become mainstream without Superintelligence specifically, FHI's legacy strengthens enormously.
Oxford Faculty releasing their perspective on FHI. If the faculty had legitimate concerns about research quality, management, or conduct that FHI's narrative obscures, the assessment of FHI's closure would shift.
Anthropomorphic AI risk scenarios materializing. If an AI system pursues instrumental convergence and power-seeking in ways that resemble Bostrom's models, the Superintelligence framework would be vindicated against its critics.
FHI alumni having outsized impact relative to comparison groups. Systematic tracking of FHI alumni vs. similar-career-stage researchers would test whether FHI's personnel pipeline was genuinely exceptional or merely well-networked.
Discovery that unspent funds were lost or misallocated. If the GBP 10M+ stuck in Oxford was returned to funders vs. absorbed by the university, this would change the financial assessment.

Self-Critique

What's weakest in this analysis:

I may be overly sympathetic to FHI's self-narrative because the Asterisk article and Final Report are the most detailed sources, and both are insider perspectives. The university's silence creates an inherent bias.
The counterfactual analysis is thin. Would AI safety have emerged as a field without Superintelligence? Probably yes — but possibly later and differently. I can't quantify this.
I haven't deeply engaged with the Toby Ord podcast transcript (46K words) — the most candid long-form source on FHI thinking. A deeper reading might shift my assessment of the intellectual substance.

What would a thoughtful person who disagrees say:

"FHI was a vanity project for Bostrom, funded by EA money, that produced speculative philosophy dressed up as risk analysis. The real work on AI safety happens at labs, not philosophy departments."
"FHI's conceptual contributions are overstated — existential risk thinking predates Bostrom (Sagan, Schell), and AI safety research would have happened anyway as capabilities advanced."
"The entire longtermist framework FHI built is a philosophical distortion that diverts resources from present suffering to speculative futures."

What information would most change my view: Oxford's internal assessment of FHI. If the university had substantive concerns about research quality, ethical conduct, or institutional integrity that go beyond bureaucratic friction, FHI's narrative of "death by bureaucracy" would be seriously undermined.

Connected to (13)

Anthropicstaff to · Jan Leike Macrostrategy Research Initiativespun off from · Nick Bostrom Coefficient Givingstaff to · Andrew Snyder-Beattie Aligned AIstaff to · Stuart Armstrong Centre for the Governance of AIspun off from · Allan Dafoe Google DeepMindstaff to · Allan Dafoe AI Impactsspun off from · Katja Grace Centre for the Study of Existential Riskcollaborator · Sean O hEigeartaigh Giving What We Canspun off from · Toby Ord Open Philanthropystaff to · Nick Beckstead

Mimir Center for Long Term Futures Researchstaff to · Anders Sandberg

Effective Ventures Foundation UKcollaborator

Global Priorities Institutecollaborator

Sources (54)

Every URL that was read during research.

1.Future of Humanity Institute - Wikipediaen.wikipedia.org
2.Looking Back at the Future of Humanity Instituteasteriskmag.com
3.The End of the Future of Humanity Institute (updated) - Daily Nousdailynous.com
4.Future of Humanity Institutefutureofhumanityinstitute.org
5.Institute That Pioneered AI ‘Existential Risk’ Research Shuts Down404media.co
6.Oxford shuts down Elon Musk-funded Future of Humanity Institute – The Oxford Studentoxfordstudent.com
7.Goodbye to FHIfinmoorhouse.com
8.Unknownnickbostrom.com
9.Why a Philosopher’s Racist Email from 26 Years Ago is News Today - Daily Nousdailynous.com
10.Philosophy don apologises for racist email: “Blacks are more stupid than whites” - Cherwellcherwell.org
11.‘Superintelligence,’ Ten Years Onquillette.com
12.Does the world need another Future of Humanity Institute?futureofbeinghuman.com
13.Nick Bostrom - Wikipediaen.wikipedia.org
14.Superintelligence: Paths, Dangers, Strategies - Wikipediaen.wikipedia.org
15.The Future of Humanity Institute closesbioethicsobservatory.org
16.The Precipice: Existential Risk and the Future of Humanity - Wikipediaen.wikipedia.org
17.End of the Future of Humanitygeneticsandsociety.org
18.The Future of the Future of Humanity Instituterealtimetechpocalypse.com
19.Future of Humanity Institute shuts downnews.ycombinator.com
20.Elite Universities Gave Us Effective Altruism, the Dumbest Idea of the Centuryjacobin.com
21.ClearerThinking.org Podcast | The path to utopia (with Nick Bostrom)podcast.clearerthinking.org
22.How Britain's oldest universities are trying to protect humanity from risky A.I.cnbc.com
23.#83 - Nick Bostrom: Simulation and Superintelligence | MIT | Artificial Intelligence Podcastlexfridman.com
24.Anders Sandberg - Wikipediaen.wikipedia.org
25.Papers — Future of Humanity Institutefutureofhumanityinstitute.org
26.How to make a difference in research: An interview with Nick Bostrom | 80,000 Hours80000hours.org
27.Nick Bostromnickbostrom.com
28.Oxford Future of Humanity Institutenickbostrom.com
29.About Us | GovAIgovernance.ai
30.Andrew Snyder-Beattieopenphilanthropy.org
31.Unknownnickbostrom.com
32.The Precipice Revisited — Toby Ordtobyord.com
33.Toby Ord - Wikipediaen.wikipedia.org
34.Projects — Toby Ordtobyord.com
35.Toby Ord on The Precipice and humanity's potential futures80000hours.org
36.Treacherous Utopia - Farsightfarsight.cifs.dk
37.TESCREAL - Wikipediaen.wikipedia.org
38.Future of Humanity Instituteweb.archive.org
39.Research - Future of Humanity Instituteweb.archive.org
40.Team - Future of Humanity Instituteweb.archive.org
41.Institute accused of ‘eugenics on steroids’ shut down by Oxford University - Cherwellcherwell.org
42.Deep Utopianickbostrom.com
43.Consent Isn't Always Enoughjefftk.com
44.Anders Sandberg — The Mimir Centermimircenter.org
45.£13.3m boost for Oxford’s Future of Humanity Institute | Universityox.ac.uk
46.Future of Humanityoxfordmartin.ox.ac.uk
47.GovAI has Relaunched | GovAI Bloggovernance.ai
48.Future of Humanity Institute — Grokipediagrokipedia.com
49.Future of Humanity Institute - Wikipediaen.wikipedia.org
50.Oxford Shuts Down Elon Musk-Funded Institute Led by Controversial Philosopher Nick Bostrom: 'Death by Bureaucracy' - London Dailylondondaily.com
51.Prominent AI Philosopher and ‘Father’ of Longtermism Sent Very Racist Email to a 90s Philosophy Listservvice.com
52.🤖🌈 My chat (+transcript) with Nick Bostrom on life in an AI utopiafasterplease.substack.com
53.Investigation Launched into Oxford Don’s Racist Email - The Oxford Bluetheoxfordblue.co.uk
54.Oxford Uni offers students support but no update on any action against Profesor Bostromthetab.com