Conjecture

Empirical Research

Cognitive emulation. Controversial.

Founded: 2022
HQ: London, UK
Team: 13
Structure: limited company (UK)
Model: Vc Investment

Theory of Change

Conjecture's theory of change has shifted substantially since founding. The current version:

Current (2024-present): AI development is "algorithmic carcinogenesis" -- capabilities growing without understanding, producing uncontrollable complexity. The solution is to treat AI as a software engineering problem. Conjecture builds "Cognitive Software" -- modular, auditable systems where LLMs are used as components within human-understandable architectures, not as autonomous black boxes. Their Wright Brothers analogy: "We're building 'airplanes' not 'engines.'" The airplane (safe architecture around the LLM engine) is what makes flight possible without killing the pilot.

Their CoEm (Cognitive Emulation) agenda proposes: "build predictably boundable systems, not directly aligned AGIs." Break cognitive tasks into primitive building blocks, compose them modularly, minimize black-box computation, keep systems in the "human regime" where existing institutions can govern them.

Original (March 2022): "If you need to roll high, roll many dice" -- solve alignment by scaling decorrelated research bets. Leahy: "I don't expect any of the current methods to work... we have to go through at least one crazy new breakthrough." This vision was abandoned after 8 months when they admitted none of the approaches had worked.

Parallel track (2023-present): Via the spinoff ControlAI, a Direct Institutional Plan to inform lawmakers about extinction risks and push for binding international regulation on advanced AI development.

What They Do

Products: Axiom (low-latency voice chat API), Cubed/Hypercubed (coding assistant), tactics.dev (AI workflow deployment platform), CTAC (custom cognitive programming language). All being consolidated into a "cognitive emulation platform." AWS infrastructure deployed via Daemon consulting partnership. No public information on customers or revenue.

Policy (via ControlAI): Direct Institutional Plan -- cold-emailing lawmakers, briefing them on extinction risks, getting public commitments to binding regulation. Claims 112 UK lawmakers supporting their campaign. Led two debates in UK House of Lords on AI extinction risk (Jan 2026). "A Narrow Path" and MAGIC proposals for international AI governance institutions.

Research output (declining): Polytope Lens paper (2022, self-assessed as negligible utility), Simulators post by janus (2022, popular but not useful per experienced researchers), sparse autoencoders interim report (Dec 2022 -- preceded and influenced Anthropic's SAE work, arguably their most impactful research contribution), Cognitive Emulation proposal (Feb 2023), Cognitive Software Roadmap (Dec 2024). The researchers who produced the most cited work have since departed.

Field-building: Hosted SERI MATS London cohort, ARENA program, AI Security Bootcamp. All funded by Open Phil grants ($1.15M total), not Conjecture's own resources.

Key People

Connor Leahy (CEO, sole Director): Born 1997, self-taught, no formal degree. Co-founded EleutherAI (GPT-J, GPT-NeoX). P(doom) ~90%. One of the most recognized figures in AI safety discourse. Extremely candid communicator -- his public profile vastly exceeds the org's institutional footprint.

Gabriel Alfour (CTO, co-founder): Ex-CEO of Marigold (Tezos L2). Background in functional programming/type theory informs the programming-language approach to cognitive software. Co-founded ControlAI. Author of the "Spectre" essay criticizing the AI safety community for not being direct enough about extinction risks.

Departures (the pattern is stark): Sid Black (co-founder/CTO, resigned as director after 4 months, no public explanation). Lee Sharkey (sparse autoencoders work, left for Apollo Research then Goodfire). Beren Millidge (SAE co-author, left). Adam Shimi (epistemologist, moved to ControlAI). Chris Scammell (COO, ex-D.E. Shaw, left for Buddhism and AI Initiative). janus (Simulators author, left). Team shrank from ~20 to ~13. Virtually every named researcher from the 2022 era has departed. What remains appears to be a CEO/communicator, a CTO/architect, and a product-focused team.

Money and Incentives

Total VC funding: ~$25M seed from Nat Friedman (ex-GitHub CEO), Daniel Gross, Patrick and John Collison (Stripe), Andrej Karpathy, Arthur Breitman (Tezos), Sam Bankman-Fried (small amount, pre-FTX), Firestreak Ventures, Metaplanet Holdings (Jaan Tallinn), Plural Platform.

Open Phil/Coefficient Giving grants: $1.15M across 4 grants (2022-2025). All for hosting programs (MATS, ARENA, bootcamp) -- zero for core research. Open Phil declined to fund Conjecture's alignment research, which Leahy discussed candidly in 2022.

Revenue: Unknown. Products exist (Axiom, Cubed, tactics.dev) but no public revenue figures. The Daemon AWS case study confirms real infrastructure and a VP of Product & Commercial, suggesting genuine commercial activity.

Business model evolution: VC-funded alignment research (2022) -> admitted alignment funding insufficient (late 2022) -> spun out product arm "Lemma Labs" (late 2022) -> "all in on CoEm" with enterprise customers (2024) -> tactics.dev and cognitive software products (2025). The trajectory from alignment research org to AI product company is unmistakable.

Financial independence from labs: Conjecture uses AWS (paid), does not receive compute credits from frontier labs. This gives them genuine independence to criticize labs, which is unusual in the safety ecosystem.

Key incentive tension: VC investors want financial returns. Alignment requires potentially uncommercial research. The alignment researchers have left; the product team has grown. The incentive gradient points toward becoming a normal AI company that uses "safety" as a brand differentiator. Whether this is actually what's happening or whether products genuinely serve alignment goals is the central question about Conjecture's current identity.

What Others Say

LessWrong "Critiques" post: Conjecture is "significantly underperforming standard academic research labs relative to their $10 million in funding." Hits-based research approach doesn't meet adequate standards. Leadership is "relatively inexperienced and stretched thin."

Manifold prediction markets: 77% chance Conjecture dissolves by 2028. 57% net good for alignment. 75% net good for the world. Community expects the org to die but to have been worth something -- likely via ControlAI's policy work.

Conjecture's own self-assessment (Nov 2022): "Most of our efforts to date have not made meaningful progress on the alignment problem." Assessed each research strand as having negligible practical utility, overinvestment, or failure to cut the hard problem.

Christiano (ARC) on Alfour's core argument: Alfour argues even narrow alignment fails because aligned systems become parts of larger unaligned systems. Christiano calls this "buck-passing." This debate is unresolved and goes to the heart of whether Conjecture's pivot from technical research to policy was insight or retreat.

Alfour on the AI safety community (Feb 2026): "The Spectre" has consistently diverted resources "away from DIP-like honest approaches that help everyone." The community has been "10 years too late" on informing the public about extinction risks, favoring approaches that "avoid alienating friends in a community that is intertwined with AI companies."

What's Absent

No peer-reviewed publications demonstrating CoEm works, despite the agenda being 3 years old. No named enterprise customers or revenue figures. No public explanation for co-founder Sid Black's departure after 4 months. No statements from any of the ~7 departed key staff about why they left. No external safety audits or independent evaluations. No research collaborations with other alignment orgs since 2022. No response to the LessWrong critiques post. No disclosure of ControlAI's funding sources. The absence of empirical validation for CoEm is the most consequential gap -- an org claiming to have found the right approach to safe AI has not published evidence it works.

Stated Theory of Change

Conjecture has had three stated theories of change, each superseding the last:

ToC 1 (March-Nov 2022): Alignment requires a breakthrough nobody has found yet. Maximize the probability of finding it by running many decorrelated research bets in parallel. "If you need to roll high, roll many dice."

ToC 2 (Feb 2023-present): Cognitive Emulation (CoEm). Build safe AI by treating it as a software engineering problem. Instead of building opaque, end-to-end black-box systems, compose modular, auditable building blocks that emulate human-like cognition. This keeps AI in the "human regime" where existing institutions can govern it, while also being commercially valuable (enterprises want reliable AI, not "a quirky brain in a jar with a personality disorder").

ToC 3 (Nov 2023-present, via ControlAI): The AI safety community has failed to inform the public and lawmakers about extinction risks. Direct institutional engagement -- briefing lawmakers, getting public commitments to binding regulation -- is the bottleneck, not technical research or persuasion. International treaties on AI development are needed, starting with non-superpowers and creating incentives for superpowers to join.

Revealed Theory of Change

Actions tell a different story than mission statements.

Revealed priorities by resource allocation:

~85% of effort appears directed at product development (Axiom, Cubed, tactics.dev, CTAC language, AWS infrastructure)
~10% at public communication and policy (Leahy's media appearances, ControlAI coordination)
~5% or less at alignment research (no published results since late 2022 from researchers still at Conjecture)

Revealed priorities by personnel:

Every named alignment researcher from the 2022 era has departed
Current team appears product-focused (VP of Product & Commercial, AWS infrastructure partnership)
ControlAI has absorbed the policy/governance talent (Miotti, Shimi)

Revealed priorities by leadership attention:

Leahy's public appearances increasingly focus on policy and x-risk communication
Alfour's most recent interview (April 2025) is about international treaties, not CoEm
The Cognitive Software Roadmap reads more as a philosophical manifesto than a technical progress report

The gap between stated and revealed theory of change is significant. Conjecture says it is building safe AI through Cognitive Emulation. What it appears to be doing is (a) building AI products that use safety as a brand differentiator, while (b) its leadership contributes to AI policy through ControlAI and public advocacy. Neither of these is bad, but neither is what the stated theory of change promises.

Key Assumptions

Assumption 1: CoEm is a viable approach to safe AI

Evidence for: The software engineering analogy is compelling. Type systems, formal methods, and modular design genuinely do make software more reliable. The Wright Brothers analogy resonates.
Evidence against: No published empirical results in 3 years. The researchers who might have validated it have left. Phase 5 (actual CoEm) of the roadmap is described as "speculative." The proposal itself is labeled "naive" by its own authors. Current LLM capabilities may not decompose cleanly into the primitive building blocks CoEm requires.
Testable: Yes. Does a CoEm system actually outperform standard approaches on reliability/auditability benchmarks? This test has not been conducted or published.
If wrong: Conjecture is building ordinary AI products with a safety brand, not a fundamentally different approach to AI.

Assumption 2: The world will choose not to build unbounded ASI, making CoEm relevant

Evidence for: Growing policy momentum (ControlAI's 112 lawmakers, House of Lords debates). Public concern about AI risks is increasing.
Evidence against: Every major lab is racing to build the most capable systems possible. DeepSeek, OpenAI, Anthropic, Google DeepMind are all scaling aggressively. No country has implemented binding restrictions on AI capabilities.
Testable: Yes, by policy outcomes over the next 2-5 years.
If wrong: CoEm is "a nice idea that gets steamrolled by GPT-N." Conjecture's products compete on normal business merits, not on being safer.

Assumption 3: Leahy's public profile is a durable asset rather than a liability

Evidence for: Genuine attention-getting. Builds awareness of AI risks. Attracts talent and investors.
Evidence against: "World's second most famous AI doomer" persona may alienate mainstream ML researchers and enterprise customers. P(doom) 90% creates credibility tension with running a product company.
If wrong: Conjecture is perceived as an ideological org rather than a serious technical effort.

Strengths

Genuine intellectual honesty. The 8-month retrospective is the most candid self-assessment by any AI safety org. Publishing "most of our efforts have not made meaningful progress" requires real epistemic integrity. This culture of honest self-assessment, if it persists, is extremely valuable.
ControlAI's DIP approach is demonstrably effective. 112 UK lawmakers supporting their campaign in ~1 year, House of Lords debates on AI extinction risk -- these are concrete policy outcomes. Alfour's "Spectre" critique of the safety community may be self-serving but contains real insight: most safety orgs have not been direct enough about extinction risks with policymakers.
Financial independence from frontier labs. Conjecture does not depend on Anthropic, OpenAI, or Google for funding, compute, or career pipeline. This gives them genuine freedom to criticize, which they exercise.
The cognitive software vision is intellectually serious. Even if CoEm is unproven, the Cognitive Software Roadmap's critique of current AI development practices is substantive. The Oracle Database analogy for current AI development is apt. The programming language approach draws on deep software engineering principles (type systems, formal methods, Unix philosophy).
Early-mover advantage on sparse autoencoders. The Sharkey et al. SAE work preceded and influenced Anthropic's much more famous work. This demonstrates the team had genuine technical insight, even if those researchers have since left.

Weaknesses and Risks

The talent exodus is severe. Virtually every named researcher from the founding era has left. The people who produced the most cited work are gone. What remains is a CEO/communicator, a CTO/architect, and what appears to be a product team. An alignment research org without alignment researchers is a product company with an alignment brand.
No empirical validation of CoEm after 3 years. The most important thing Conjecture could do to establish credibility -- publish benchmarks showing CoEm systems are actually more reliable and auditable than standard approaches -- has not been done. This absence undermines every claim about the technical approach.
Serial pivots erode credibility. Interpretability -> Simulators -> Epistemology -> CoEm -> Products. Each pivot was individually defensible, but the pattern reads as a team that keeps looking for product-market fit rather than a research org methodically working a problem.
Governance concentration risk. Leahy is sole director and sole person with significant control. No board, no external oversight. If Leahy's judgment fails, there is no check.
The CoEm thesis may be irrelevant if the world doesn't slow down. CoEm explicitly depends on the world making "the sensible choice to not go straight for ASI." If frontier labs keep building unbounded systems, CoEm-bounded systems may be too weak to matter commercially or for safety.
The Christiano critique is unresolved. Alfour's own arguments suggest narrow technical alignment is insufficient because aligned systems become parts of larger unaligned systems. If true, this undermines Conjecture's own technical work in favor of ControlAI's policy work. The intellectual case for CoEm is weakest from Conjecture's own CTO.

Cross-References

ControlAI is the most important entity in Conjecture's orbit. It may end up being the lasting contribution of the Conjecture ecosystem, even if Conjecture itself dissolves (as Manifold's 77% probability suggests).
EleutherAI is the origin community. Conjecture represents the "alignment branch" of EleutherAI, while Stability AI represents the "capabilities branch." Both emerged from the same community with very different outcomes.
Anthropic had its sparse autoencoders work preceded by Conjecture's SAE interim report. This is the clearest evidence of Conjecture contributing to the broader technical alignment field.
ARC (Alignment Research Center) -- the Christiano-Alfour debate is a key intellectual document. Christiano's "buck-passing" critique is the strongest technical argument against Conjecture's trajectory.
PauseAI and AI advocacy orgs -- ControlAI is described as the "inside game" strategy (direct lawmaker engagement) vs. PauseAI's "outside game" (public demonstrations). They are complementary, not competing.

What Would Change This Assessment

Published CoEm benchmarks showing genuine improvement over standard approaches would be the single biggest positive update. If CoEm systems demonstrably outperform standard LLM approaches on reliability/auditability in controlled evaluations, the entire assessment shifts.
Significant new funding round or revenue milestone would indicate external validation of the product direction.
Successful international AI treaty with ControlAI involvement would validate the policy theory of change.
Return of alignment researchers or significant new technical hires would indicate the research direction is alive, not abandoned.
Conjecture dissolving with ControlAI continuing would confirm the Manifold thesis that the policy arm is the lasting contribution.

Self-Critique

What sources should I have checked but didn't:

The Compendium (100+ page PDF, not fetched). Contains the most complete argument for extinction risk from the Conjecture team.
Full debate transcripts (Leahy vs. Beff Jezos, Leahy vs. George Hotz) -- only available as video/audio.
The original Conjecture AMA on Alignment Forum (founding Q&A).
Full text of the LessWrong critiques post (only search summary available).

Where is this analysis potentially biased:

I may be reading the talent exodus too negatively. In a fast-moving field, researchers move frequently, and departures don't necessarily indicate organizational failure.
The absence of published CoEm results may reflect legitimate infohazard concerns rather than lack of progress -- Conjecture's stated policy is to keep things secret by default.
I may be underweighting ControlAI's importance by framing it as a "spinoff" rather than as the primary vehicle through which the Conjecture ecosystem creates value.

What would a thoughtful person who disagrees say: "Conjecture correctly identified that the bottleneck is not technical alignment research but policy and governance. They pivoted accordingly -- ControlAI for policy, Conjecture for products that demonstrate what safe AI looks like in practice. The researcher departures reflect normal career mobility, not organizational failure. CoEm is unpublished because it's commercially sensitive, not because it doesn't work. Leahy and Alfour are doing exactly the right thing by building both a business case and a policy case for safe AI, rather than publishing papers that get cited but change nothing."

My single weakest claim: That the talent exodus indicates organizational dysfunction rather than normal career mobility. I have no inside knowledge and no statements from departed staff. The inference is pattern-based, not evidence-based.

What information would most change my view: Published, peer-reviewed benchmarks showing CoEm systems meaningfully outperforming standard approaches on reliability and auditability metrics. This would resolve the central uncertainty about whether Conjecture is building something genuinely new or repackaging standard AI engineering with safety branding.

Connected to (9)

Daemoncollaborator

Goodfirestaff to · Lee Sharkey

Apollo Researchstaff to · Lee Sharkey

ControlAIspun off from · Gabriel Alfour

Alignment Research Centercollaborator · Paul Christiano

Anthropiccollaborator

EleutherAIspun off from · Connor Leahy

MATScollaborator

Stability AIcollaborator

Sources (47)

Every URL that was read during research.

1.Connor Leahy - Wikipediaen.wikipedia.org
2.Conjectureconjecture.dev
3.Conjectureconjecture.dev
4.Conjectureconjecture.dev
5.'We are super, super fucked': Meet the man trying to stop an AI apocalypsesifted.eu
6.Researcher: To Stop AI Killing Us, First Regulate Deepfakestime.com
7.About Us | ControlAIcontrolai.com
8.A Narrow Pathcontrolai.news
9.Conjecture: a retrospective after 8 months of work - Conjectureconjecture.dev
10.Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI - Conjectureconjecture.dev
11.Conjecture: 2 Years - Conjectureconjecture.dev
12.Cognitive Emulation: A Naive AI Safety Proposal - Conjectureconjecture.dev
13.Connor Leahy on Dignity and Conjecturetheinsideview.ai
14.🔮 Does AI present an existential risk?exponentialview.co
15.177 - AI is a Ticking Time Bomb with Connor Leahy on Banklessbankless.com
16.The Direct Institutional Plan | ControlAIcontrolai.com
17.Connor Leahy - People in AI - AI Blogartificial-intelligence.blog
18.Conjectureconjecture.dev
19.Connor Leahy – Biography, Net Worth, Career, Companies & Lifestyle [2026]eboona.com
20.Connor Leahy (EleutherAI/GPT-J) interview transcriptslifearchitect.ai
21.Asking Big Tech to police AI is like turning to ‘oil companies to solve climate change,’ AI researcher says | Fortunefortune.com
22.Sunak’s global AI safety summit risks achieving very little, warns tech bosstheguardian.com
23.EleutherAI - Wikipediaen.wikipedia.org
24.Avoiding Extinction with Andrea Miotti and Connor Leahycontrolai.news
25.Conjectureconjecture.dev
26.Conjecture - 2026 Company Profile & Team - Tracxntracxn.com
27.A brief guide to the groups protesting over AItransformernews.ai
28.Andrea Miotti - A Human-First AI Future (AGI Governance, Episode 4) - Daniel Faggelladanfaggella.com
29.Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxesconjecture.dev
30.Sparse Autoencoders Find Highly Interpretable Features in Language Modelsarxiv.org
31.A theory of neural emulatorsarxiv.org
32.Introductionnarrowpath.co
33.Will Conjecture produce work that they believe constitutes meaningful progress towards alignment by the end of 2023?manifold.markets
34.Connor Leahyprogressive.org
35.Memes tell the story of a secret war in tech. It's no jokeabc.net.au
36.AI Security Bootcampaisb.dev
37.Unknownberen.io
38.Podcaststhecompendium.ai
39.#16: Gabe Alfour on Competing Beliefs About Superintelligenceaipolicypod.substack.com
40.CONJECTURE LTDfind-and-update.company-information.service.gov.uk
41.CONJECTURE LTDfind-and-update.company-information.service.gov.uk
42.Conjecture Ltdopengovuk.com
43.List of p(doom) valuespauseai.info
44.Connor Leahy: We lack understanding of intelligence and neural networks, the unpredictability of AI could lead to loss of control, and GPT models have revolutionized AI capabilities | The Peter McCormack Showcryptobriefing.com
45.The Spectre haunting the "AI Safety" Communitycognition.cafe
46.Daemon brings AI startup Conjecture's AI inference ecosystem into AWSdae.mn
47.Conjecture | Competitive Intelligence Profilerivalsense.co