← AI Safety Orgs

Conjecture

Empirical Research

Cognitive emulation. Controversial.

Founded
2022
HQ
London, UK
Team
13
Structure
limited company (UK)
Model
Vc Investment

Theory of Change

Conjecture's theory of change has shifted substantially since founding. The current version:

Current (2024-present): AI development is "algorithmic carcinogenesis" -- capabilities growing without understanding, producing uncontrollable complexity. The solution is to treat AI as a software engineering problem. Conjecture builds "Cognitive Software" -- modular, auditable systems where LLMs are used as components within human-understandable architectures, not as autonomous black boxes. Their Wright Brothers analogy: "We're building 'airplanes' not 'engines.'" The airplane (safe architecture around the LLM engine) is what makes flight possible without killing the pilot.

Their CoEm (Cognitive Emulation) agenda proposes: "build predictably boundable systems, not directly aligned AGIs." Break cognitive tasks into primitive building blocks, compose them modularly, minimize black-box computation, keep systems in the "human regime" where existing institutions can govern them.

Original (March 2022): "If you need to roll high, roll many dice" -- solve alignment by scaling decorrelated research bets. Leahy: "I don't expect any of the current methods to work... we have to go through at least one crazy new breakthrough." This vision was abandoned after 8 months when they admitted none of the approaches had worked.

Parallel track (2023-present): Via the spinoff ControlAI, a Direct Institutional Plan to inform lawmakers about extinction risks and push for binding international regulation on advanced AI development.

What They Do

Products: Axiom (low-latency voice chat API), Cubed/Hypercubed (coding assistant), tactics.dev (AI workflow deployment platform), CTAC (custom cognitive programming language). All being consolidated into a "cognitive emulation platform." AWS infrastructure deployed via Daemon consulting partnership. No public information on customers or revenue.

Policy (via ControlAI): Direct Institutional Plan -- cold-emailing lawmakers, briefing them on extinction risks, getting public commitments to binding regulation. Claims 112 UK lawmakers supporting their campaign. Led two debates in UK House of Lords on AI extinction risk (Jan 2026). "A Narrow Path" and MAGIC proposals for international AI governance institutions.

Research output (declining): Polytope Lens paper (2022, self-assessed as negligible utility), Simulators post by janus (2022, popular but not useful per experienced researchers), sparse autoencoders interim report (Dec 2022 -- preceded and influenced Anthropic's SAE work, arguably their most impactful research contribution), Cognitive Emulation proposal (Feb 2023), Cognitive Software Roadmap (Dec 2024). The researchers who produced the most cited work have since departed.

Field-building: Hosted SERI MATS London cohort, ARENA program, AI Security Bootcamp. All funded by Open Phil grants ($1.15M total), not Conjecture's own resources.

Key People

Connor Leahy (CEO, sole Director): Born 1997, self-taught, no formal degree. Co-founded EleutherAI (GPT-J, GPT-NeoX). P(doom) ~90%. One of the most recognized figures in AI safety discourse. Extremely candid communicator -- his public profile vastly exceeds the org's institutional footprint.

Gabriel Alfour (CTO, co-founder): Ex-CEO of Marigold (Tezos L2). Background in functional programming/type theory informs the programming-language approach to cognitive software. Co-founded ControlAI. Author of the "Spectre" essay criticizing the AI safety community for not being direct enough about extinction risks.

Departures (the pattern is stark): Sid Black (co-founder/CTO, resigned as director after 4 months, no public explanation). Lee Sharkey (sparse autoencoders work, left for Apollo Research then Goodfire). Beren Millidge (SAE co-author, left). Adam Shimi (epistemologist, moved to ControlAI). Chris Scammell (COO, ex-D.E. Shaw, left for Buddhism and AI Initiative). janus (Simulators author, left). Team shrank from ~20 to ~13. Virtually every named researcher from the 2022 era has departed. What remains appears to be a CEO/communicator, a CTO/architect, and a product-focused team.

Money and Incentives

Total VC funding: ~$25M seed from Nat Friedman (ex-GitHub CEO), Daniel Gross, Patrick and John Collison (Stripe), Andrej Karpathy, Arthur Breitman (Tezos), Sam Bankman-Fried (small amount, pre-FTX), Firestreak Ventures, Metaplanet Holdings (Jaan Tallinn), Plural Platform.

Open Phil/Coefficient Giving grants: $1.15M across 4 grants (2022-2025). All for hosting programs (MATS, ARENA, bootcamp) -- zero for core research. Open Phil declined to fund Conjecture's alignment research, which Leahy discussed candidly in 2022.

Revenue: Unknown. Products exist (Axiom, Cubed, tactics.dev) but no public revenue figures. The Daemon AWS case study confirms real infrastructure and a VP of Product & Commercial, suggesting genuine commercial activity.

Business model evolution: VC-funded alignment research (2022) -> admitted alignment funding insufficient (late 2022) -> spun out product arm "Lemma Labs" (late 2022) -> "all in on CoEm" with enterprise customers (2024) -> tactics.dev and cognitive software products (2025). The trajectory from alignment research org to AI product company is unmistakable.

Financial independence from labs: Conjecture uses AWS (paid), does not receive compute credits from frontier labs. This gives them genuine independence to criticize labs, which is unusual in the safety ecosystem.

Key incentive tension: VC investors want financial returns. Alignment requires potentially uncommercial research. The alignment researchers have left; the product team has grown. The incentive gradient points toward becoming a normal AI company that uses "safety" as a brand differentiator. Whether this is actually what's happening or whether products genuinely serve alignment goals is the central question about Conjecture's current identity.

What Others Say

LessWrong "Critiques" post: Conjecture is "significantly underperforming standard academic research labs relative to their $10 million in funding." Hits-based research approach doesn't meet adequate standards. Leadership is "relatively inexperienced and stretched thin."

Manifold prediction markets: 77% chance Conjecture dissolves by 2028. 57% net good for alignment. 75% net good for the world. Community expects the org to die but to have been worth something -- likely via ControlAI's policy work.

Conjecture's own self-assessment (Nov 2022): "Most of our efforts to date have not made meaningful progress on the alignment problem." Assessed each research strand as having negligible practical utility, overinvestment, or failure to cut the hard problem.

Christiano (ARC) on Alfour's core argument: Alfour argues even narrow alignment fails because aligned systems become parts of larger unaligned systems. Christiano calls this "buck-passing." This debate is unresolved and goes to the heart of whether Conjecture's pivot from technical research to policy was insight or retreat.

Alfour on the AI safety community (Feb 2026): "The Spectre" has consistently diverted resources "away from DIP-like honest approaches that help everyone." The community has been "10 years too late" on informing the public about extinction risks, favoring approaches that "avoid alienating friends in a community that is intertwined with AI companies."

What's Absent

No peer-reviewed publications demonstrating CoEm works, despite the agenda being 3 years old. No named enterprise customers or revenue figures. No public explanation for co-founder Sid Black's departure after 4 months. No statements from any of the ~7 departed key staff about why they left. No external safety audits or independent evaluations. No research collaborations with other alignment orgs since 2022. No response to the LessWrong critiques post. No disclosure of ControlAI's funding sources. The absence of empirical validation for CoEm is the most consequential gap -- an org claiming to have found the right approach to safe AI has not published evidence it works.

Recommended Reading

  1. Inside View interview with Connor Leahy (July 2022) -- 3-hour transcript. The most candid source on Conjecture's founding worldview. Leahy on doom, EleutherAI, VC vs EA funding, and why he expects to fail but tries anyway. https://theinsideview.ai/connor2

  2. Critiques of prominent AI safety labs: Conjecture -- The strongest external critique, arguing their research output is inadequate relative to funding. https://www.lesswrong.com/posts/9jvrQToSq3CYvoeHf/critiques-of-prominent-ai-safety-labs-conjecture

  3. 8-Month Retrospective (Nov 2022) -- "Most of our efforts have not made meaningful progress." The most honest self-assessment by any AI safety org. https://www.conjecture.dev/research/conjecture-a-retrospective-after-8-months-of-work

  4. Alfour on Competing Beliefs About Superintelligence (April 2025) -- Most recent leadership interview. Current strategic thinking on treaties, race dynamics, and Conjecture's evolution. https://aipolicypod.substack.com/p/16-gabe-alfour-on-competing-beliefs

  5. Cognitive Software Roadmap (Dec 2024) -- Their most complete vision document. Excellent critique of current AI development; reveals how far CoEm is from completion. https://www.conjecture.dev/research/conjecture-a-roadmap-for-cognitive-software-and-a-humanist-future-of-ai

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

Conjecture has had three stated theories of change, each superseding the last:

ToC 1 (March-Nov 2022): Alignment requires a breakthrough nobody has found yet. Maximize the probability of finding it by running many decorrelated research bets in parallel. "If you need to roll high, roll many dice."

ToC 2 (Feb 2023-present): Cognitive Emulation (CoEm). Build safe AI by treating it as a software engineering problem. Instead of building opaque, end-to-end black-box systems, compose modular, auditable building blocks that emulate human-like cognition. This keeps AI in the "human regime" where existing institutions can govern it, while also being commercially valuable (enterprises want reliable AI, not "a quirky brain in a jar with a personality disorder").

ToC 3 (Nov 2023-present, via ControlAI): The AI safety community has failed to inform the public and lawmakers about extinction risks. Direct institutional engagement -- briefing lawmakers, getting public commitments to binding regulation -- is the bottleneck, not technical research or persuasion. International treaties on AI development are needed, starting with non-superpowers and creating incentives for superpowers to join.

Revealed Theory of Change

Actions tell a different story than mission statements.

Revealed priorities by resource allocation:

  • ~85% of effort appears directed at product development (Axiom, Cubed, tactics.dev, CTAC language, AWS infrastructure)
  • ~10% at public communication and policy (Leahy's media appearances, ControlAI coordination)
  • ~5% or less at alignment research (no published results since late 2022 from researchers still at Conjecture)

Revealed priorities by personnel:

  • Every named alignment researcher from the 2022 era has departed
  • Current team appears product-focused (VP of Product & Commercial, AWS infrastructure partnership)
  • ControlAI has absorbed the policy/governance talent (Miotti, Shimi)

Revealed priorities by leadership attention:

  • Leahy's public appearances increasingly focus on policy and x-risk communication
  • Alfour's most recent interview (April 2025) is about international treaties, not CoEm
  • The Cognitive Software Roadmap reads more as a philosophical manifesto than a technical progress report

The gap between stated and revealed theory of change is significant. Conjecture says it is building safe AI through Cognitive Emulation. What it appears to be doing is (a) building AI products that use safety as a brand differentiator, while (b) its leadership contributes to AI policy through ControlAI and public advocacy. Neither of these is bad, but neither is what the stated theory of change promises.

Key Assumptions

Assumption 1: CoEm is a viable approach to safe AI

  • Evidence for: The software engineering analogy is compelling. Type systems, formal methods, and modular design genuinely do make software more reliable. The Wright Brothers analogy resonates.
  • Evidence against: No published empirical results in 3 years. The researchers who might have validated it have left. Phase 5 (actual CoEm) of the roadmap is described as "speculative." The proposal itself is labeled "naive" by its own authors. Current LLM capabilities may not decompose cleanly into the primitive building blocks CoEm requires.
  • Testable: Yes. Does a CoEm system actually outperform standard approaches on reliability/auditability benchmarks? This test has not been conducted or published.
  • If wrong: Conjecture is building ordinary AI products with a safety brand, not a fundamentally different approach to AI.

Assumption 2: The world will choose not to build unbounded ASI, making CoEm relevant

  • Evidence for: Growing policy momentum (ControlAI's 112 lawmakers, House of Lords debates). Public concern about AI risks is increasing.
  • Evidence against: Every major lab is racing to build the most capable systems possible. DeepSeek, OpenAI, Anthropic, Google DeepMind are all scaling aggressively. No country has implemented binding restrictions on AI capabilities.
  • Testable: Yes, by policy outcomes over the next 2-5 years.
  • If wrong: CoEm is "a nice idea that gets steamrolled by GPT-N." Conjecture's products compete on normal business merits, not on being safer.

Assumption 3: Leahy's public profile is a durable asset rather than a liability

  • Evidence for: Genuine attention-getting. Builds awareness of AI risks. Attracts talent and investors.
  • Evidence against: "World's second most famous AI doomer" persona may alienate mainstream ML researchers and enterprise customers. P(doom) 90% creates credibility tension with running a product company.
  • If wrong: Conjecture is perceived as an ideological org rather than a serious technical effort.

Strengths

  1. Genuine intellectual honesty. The 8-month retrospective is the most candid self-assessment by any AI safety org. Publishing "most of our efforts have not made meaningful progress" requires real epistemic integrity. This culture of honest self-assessment, if it persists, is extremely valuable.

  2. ControlAI's DIP approach is demonstrably effective. 112 UK lawmakers supporting their campaign in ~1 year, House of Lords debates on AI extinction risk -- these are concrete policy outcomes. Alfour's "Spectre" critique of the safety community may be self-serving but contains real insight: most safety orgs have not been direct enough about extinction risks with policymakers.

  3. Financial independence from frontier labs. Conjecture does not depend on Anthropic, OpenAI, or Google for funding, compute, or career pipeline. This gives them genuine freedom to criticize, which they exercise.

  4. The cognitive software vision is intellectually serious. Even if CoEm is unproven, the Cognitive Software Roadmap's critique of current AI development practices is substantive. The Oracle Database analogy for current AI development is apt. The programming language approach draws on deep software engineering principles (type systems, formal methods, Unix philosophy).

  5. Early-mover advantage on sparse autoencoders. The Sharkey et al. SAE work preceded and influenced Anthropic's much more famous work. This demonstrates the team had genuine technical insight, even if those researchers have since left.

Weaknesses and Risks

  1. The talent exodus is severe. Virtually every named researcher from the founding era has left. The people who produced the most cited work are gone. What remains is a CEO/communicator, a CTO/architect, and what appears to be a product team. An alignment research org without alignment researchers is a product company with an alignment brand.

  2. No empirical validation of CoEm after 3 years. The most important thing Conjecture could do to establish credibility -- publish benchmarks showing CoEm systems are actually more reliable and auditable than standard approaches -- has not been done. This absence undermines every claim about the technical approach.

  3. Serial pivots erode credibility. Interpretability -> Simulators -> Epistemology -> CoEm -> Products. Each pivot was individually defensible, but the pattern reads as a team that keeps looking for product-market fit rather than a research org methodically working a problem.

  4. Governance concentration risk. Leahy is sole director and sole person with significant control. No board, no external oversight. If Leahy's judgment fails, there is no check.

  5. The CoEm thesis may be irrelevant if the world doesn't slow down. CoEm explicitly depends on the world making "the sensible choice to not go straight for ASI." If frontier labs keep building unbounded systems, CoEm-bounded systems may be too weak to matter commercially or for safety.

  6. The Christiano critique is unresolved. Alfour's own arguments suggest narrow technical alignment is insufficient because aligned systems become parts of larger unaligned systems. If true, this undermines Conjecture's own technical work in favor of ControlAI's policy work. The intellectual case for CoEm is weakest from Conjecture's own CTO.

Cross-References

  • ControlAI is the most important entity in Conjecture's orbit. It may end up being the lasting contribution of the Conjecture ecosystem, even if Conjecture itself dissolves (as Manifold's 77% probability suggests).
  • EleutherAI is the origin community. Conjecture represents the "alignment branch" of EleutherAI, while Stability AI represents the "capabilities branch." Both emerged from the same community with very different outcomes.
  • Anthropic had its sparse autoencoders work preceded by Conjecture's SAE interim report. This is the clearest evidence of Conjecture contributing to the broader technical alignment field.
  • ARC (Alignment Research Center) -- the Christiano-Alfour debate is a key intellectual document. Christiano's "buck-passing" critique is the strongest technical argument against Conjecture's trajectory.
  • PauseAI and AI advocacy orgs -- ControlAI is described as the "inside game" strategy (direct lawmaker engagement) vs. PauseAI's "outside game" (public demonstrations). They are complementary, not competing.

What Would Change This Assessment

  • Published CoEm benchmarks showing genuine improvement over standard approaches would be the single biggest positive update. If CoEm systems demonstrably outperform standard LLM approaches on reliability/auditability in controlled evaluations, the entire assessment shifts.
  • Significant new funding round or revenue milestone would indicate external validation of the product direction.
  • Successful international AI treaty with ControlAI involvement would validate the policy theory of change.
  • Return of alignment researchers or significant new technical hires would indicate the research direction is alive, not abandoned.
  • Conjecture dissolving with ControlAI continuing would confirm the Manifold thesis that the policy arm is the lasting contribution.

Self-Critique

What sources should I have checked but didn't:

  • The Compendium (100+ page PDF, not fetched). Contains the most complete argument for extinction risk from the Conjecture team.
  • Full debate transcripts (Leahy vs. Beff Jezos, Leahy vs. George Hotz) -- only available as video/audio.
  • The original Conjecture AMA on Alignment Forum (founding Q&A).
  • Full text of the LessWrong critiques post (only search summary available).

Where is this analysis potentially biased:

  • I may be reading the talent exodus too negatively. In a fast-moving field, researchers move frequently, and departures don't necessarily indicate organizational failure.
  • The absence of published CoEm results may reflect legitimate infohazard concerns rather than lack of progress -- Conjecture's stated policy is to keep things secret by default.
  • I may be underweighting ControlAI's importance by framing it as a "spinoff" rather than as the primary vehicle through which the Conjecture ecosystem creates value.

What would a thoughtful person who disagrees say: "Conjecture correctly identified that the bottleneck is not technical alignment research but policy and governance. They pivoted accordingly -- ControlAI for policy, Conjecture for products that demonstrate what safe AI looks like in practice. The researcher departures reflect normal career mobility, not organizational failure. CoEm is unpublished because it's commercially sensitive, not because it doesn't work. Leahy and Alfour are doing exactly the right thing by building both a business case and a policy case for safe AI, rather than publishing papers that get cited but change nothing."

My single weakest claim: That the talent exodus indicates organizational dysfunction rather than normal career mobility. I have no inside knowledge and no statements from departed staff. The inference is pattern-based, not evidence-based.

What information would most change my view: Published, peer-reviewed benchmarks showing CoEm systems meaningfully outperforming standard approaches on reliability and auditability metrics. This would resolve the central uncertainty about whether Conjecture is building something genuinely new or repackaging standard AI engineering with safety branding.

Connected to (9)

Daemoncollaborator
Goodfirestaff to · Lee Sharkey
Apollo Researchstaff to · Lee Sharkey
ControlAIspun off from · Gabriel Alfour
Alignment Research Centercollaborator · Paul Christiano
Anthropiccollaborator
EleutherAIspun off from · Connor Leahy
MATScollaborator
Stability AIcollaborator
Sources (47)
Every URL that was read during research.
  1. 1.Connor Leahy - Wikipediaen.wikipedia.org
  2. 2.Conjectureconjecture.dev
  3. 3.Conjectureconjecture.dev
  4. 4.Conjectureconjecture.dev
  5. 5.'We are super, super fucked': Meet the man trying to stop an AI apocalypsesifted.eu
  6. 6.Researcher: To Stop AI Killing Us, First Regulate Deepfakestime.com
  7. 7.About Us | ControlAIcontrolai.com
  8. 8.A Narrow Pathcontrolai.news
  9. 9.Conjecture: a retrospective after 8 months of work - Conjectureconjecture.dev
  10. 10.Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI - Conjectureconjecture.dev
  11. 11.Conjecture: 2 Years - Conjectureconjecture.dev
  12. 12.Cognitive Emulation: A Naive AI Safety Proposal - Conjectureconjecture.dev
  13. 13.Connor Leahy on Dignity and Conjecturetheinsideview.ai
  14. 14.🔮 Does AI present an existential risk?exponentialview.co
  15. 15.177 - AI is a Ticking Time Bomb with Connor Leahy on Banklessbankless.com
  16. 16.The Direct Institutional Plan | ControlAIcontrolai.com
  17. 17.Connor Leahy - People in AI - AI Blogartificial-intelligence.blog
  18. 18.Conjectureconjecture.dev
  19. 19.Connor Leahy – Biography, Net Worth, Career, Companies & Lifestyle [2026]eboona.com
  20. 20.Connor Leahy (EleutherAI/GPT-J) interview transcriptslifearchitect.ai
  21. 21.Asking Big Tech to police AI is like turning to ‘oil companies to solve climate change,’ AI researcher says | Fortunefortune.com
  22. 22.Sunak’s global AI safety summit risks achieving very little, warns tech bosstheguardian.com
  23. 23.EleutherAI - Wikipediaen.wikipedia.org
  24. 24.Avoiding Extinction with Andrea Miotti and Connor Leahycontrolai.news
  25. 25.Conjectureconjecture.dev
  26. 26.Conjecture - 2026 Company Profile & Team - Tracxntracxn.com
  27. 27.A brief guide to the groups protesting over AItransformernews.ai
  28. 28.Andrea Miotti - A Human-First AI Future (AGI Governance, Episode 4) - Daniel Faggelladanfaggella.com
  29. 29.Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxesconjecture.dev
  30. 30.Sparse Autoencoders Find Highly Interpretable Features in Language Modelsarxiv.org
  31. 31.A theory of neural emulatorsarxiv.org
  32. 32.Introductionnarrowpath.co
  33. 33.Will Conjecture produce work that they believe constitutes meaningful progress towards alignment by the end of 2023?manifold.markets
  34. 34.Connor Leahyprogressive.org
  35. 35.Memes tell the story of a secret war in tech. It's no jokeabc.net.au
  36. 36.AI Security Bootcampaisb.dev
  37. 37.Unknownberen.io
  38. 38.Podcaststhecompendium.ai
  39. 39.#16: Gabe Alfour on Competing Beliefs About Superintelligenceaipolicypod.substack.com
  40. 40.CONJECTURE LTDfind-and-update.company-information.service.gov.uk
  41. 41.CONJECTURE LTDfind-and-update.company-information.service.gov.uk
  42. 42.Conjecture Ltdopengovuk.com
  43. 43.List of p(doom) valuespauseai.info
  44. 44.Connor Leahy: We lack understanding of intelligence and neural networks, the unpredictability of AI could lead to loss of control, and GPT models have revolutionized AI capabilities | The Peter McCormack Showcryptobriefing.com
  45. 45.The Spectre haunting the "AI Safety" Communitycognition.cafe
  46. 46.Daemon brings AI startup Conjecture's AI inference ecosystem into AWSdae.mn
  47. 47.Conjecture | Competitive Intelligence Profilerivalsense.co