Theory of Change
Ought was founded in 2017 with a theory of change rooted in factored cognition: the idea that complex reasoning tasks can be decomposed into smaller sub-tasks that are independently evaluable, enabling humans to supervise AI systems even when those systems exceed human capability on the full task. This drew directly from Paul Christiano's Iterated Distillation and Amplification framework.
Andreas Stuhlmuller, from a 2020 EA talk: "Can you somehow assign a mechanism that arranges your interaction with [AI] experts, such that they try to be as helpful to you as an expert who intrinsically cares about you? That's the problem."
The theory evolved in 2022 into a broader framework: process-based vs. outcome-based ML systems. Process-based systems are transparent, decomposed, human-supervisable. Outcome-based systems are end-to-end optimized, opaque, and prone to gaming metrics. The argument: process-based systems have (1) better differential capabilities for tasks without clear outcome metrics, (2) better alignment properties long-term, and (3) both architectures are "attractors" with lock-in effects, so pushing for process now matters.
The practical instantiation is Elicit, an AI research assistant. The "alignment windfall" framing (articulated by James Brady, Head of Eng): "For Elicit, creating a valuable product is the same thing as building a truthful, transparent system. Trustworthiness is our value proposition." If Elicit succeeds commercially using process-based architectures, it becomes an "existence proof" that others copy -- shifting the landscape toward more alignable approaches.
What They Do
2017-2020 (research phase): Human experiments on factored cognition. Participants given small context windows to solve tasks cooperatively. Key finding: error propagation is the critical challenge -- you need error correction mechanisms that are themselves composable from small pieces.
2021-2023 (product launch): Elicit launched as an AI research assistant for literature review. Users ask research questions, receive structured tables of papers with AI-extracted data (population, intervention, outcomes, limitations). Each answer linked to specific paper passages. 200K monthly users by September 2023.
Sept 2023-present (PBC era): Elicit spun off as a Public Benefit Corporation. Raised $9M seed (Fifty Years), then $22M Series A at $100M valuation (Spark Capital + Footwork, Feb 2025). 400K+ monthly active users. Estimated $18-22M ARR. Shipped 1.4 features per week on average. Major launches: Notebooks, Research Agents, Systematic Reviews, Reports.
Technical approach: Multi-model ensemble (fine-tuned Flan-T5, GPT-3.5/4, Claude). Hybrid semantic + lexical search with multi-stage re-ranking. Chain of thought for all extractions. Factored verification to detect hallucinations (published at WIESP 2023 -- reduced hallucinations by ~40-50% via self-correction). Constitutional AI for training data generation.
Research output: Publication trajectory shifted from theoretical (factored cognition experiments) to applied (factored verification, iterated decomposition) to product-oriented. Key papers: Iterated Decomposition (2023, improved science Q&A from 25% to 65% accuracy), Factored Verification (2023), Supervise Process not Outcomes (2022), RAFT benchmark (NeurIPS 2021).
Open source: ICE (Interactive Composition Explorer) for compositional LM programs. Factored Cognition Primer tutorial.
Key People
Andreas Stuhlmuller -- CEO and co-founder. PhD Cognitive Science MIT (Tenenbaum), postdoc Stanford (Goodman). 20+ publications in probabilistic programming and computational cognitive science. Has been thinking about automating reasoning since his teens. Compensation at Ought was $120-167K -- well below market.
Jungwon Byun -- COO and co-founder. BA Economics Yale. Former Head of Growth at Upstart (helped scale 5x). Brings operational and growth expertise. Unusually effective at communicating the technical vision.
Owain Evans -- Board member and CEO of Ought (as of FY2025). Published TruthfulQA, Emergent Misalignment (Nature 2026), Reversal Curse. Runs Truthful AI research group in Berkeley. World-class AI safety researcher whose continued involvement adds significant credibility.
Notable departure: William Saunders, early Ought software engineer, co-authored key papers, later joined OpenAI alignment team, then resigned Feb 2024 citing safety concerns. Said he asked himself if OpenAI was "more like the Apollo program or more like the Titanic."
Team: ~25-30 at Elicit. Mix of EA-adjacent (GiveWell), academic (PhD philosophy, PhD causality/safety), and strong tech (Stripe, Square, Figma, Google) backgrounds.
Money and Incentives
Ought (nonprofit) -- ~$9M total grants:
- Open Philanthropy: $3.1M (2018-2020, "general support" under "Navigating Transformative AI")
- FTX Future Fund: ~$5M (mid-2022, for "building Elicit" -- disbursed before FTX collapse)
- Survival and Flourishing Fund: ~$800K
- EA Funds: ~$60K
Elicit (PBC) -- $31M VC + growing product revenue:
- $9M seed (Sept 2023, Fifty Years led)
- $22M Series A (Feb 2025, Spark Capital + Footwork, $100M valuation)
- Angel investors: Jeff Dean (Google), Tom Preston-Werner (GitHub), Thomas Ebeling (ex-Novartis CEO)
- Estimated $18-22M ARR in 2025; $1M ARR within 4 months of launching paid tier
The transition: Ought sold its IP to Elicit PBC at an independent valuation (~$8.2M per FY2025 990). Ought now holds ~$10M in Elicit equity at carrying value. Zero employees, zero operations. The nonprofit is effectively a dormant equity holding company that could reactivate if Elicit returns profits.
Incentive alignment (claimed): Accuracy IS the value proposition. Building trustworthy AI IS building a good product. No lab funding, no compute dependency, no board overlap with frontier labs. This is the cleanest financial structure of any org claiming to do both commercial AI and safety work.
Incentive concerns (structural): As a VC-backed company, Elicit faces growth pressure. If end-to-end approaches become more accurate than process-based decomposition for certain tasks, will Elicit maintain its commitment to transparency, or will it adopt opaque methods to stay competitive? The PBC structure provides some protection, but ultimately, a PBC board can redefine its benefit obligations.
What Others Say
Vaniver (AI Alignment Forum): Factored cognition has a tree structure limitation -- horizontal communication between branches is impossible, leading to duplicated effort. Additionally, there is a safety-generality tradeoff: FC "allows you to stick to cognitive strategies that definitely solve a problem in a safe way" but may not allow developing new strategies without opening the door to inner optimizers.
Former Ought employee (Eli, EA Forum): Considers the theory of change "very conjunctive" -- process-based systems must be competitive with end-to-end training for AGI, which may be unlikely. Also flags uncertainty about "whether generally speeding up science is good or bad."
Andreas himself (acknowledging the "bitter lesson"): "The history of machine learning is the bitter lesson of outcomes winning. Vision and NLP started with more structured systems, which were replaced with end-to-end systems." He argues this pattern may not hold for high-stakes reasoning where transparency matters, but acknowledges this is "up in the air."
Jungwon (on RLHF calibration): "A lot of the reinforcement learning is causing the models to be incredibly uncalibrated and overconfident in their answers. And so that is preventing us from being able to use these models in particular ways." A candid acknowledgment of dependence on upstream model properties Elicit cannot control.
Footwork Ventures (investor, so discount accordingly): "Perhaps no company's story exemplifies [lifelong commitment] more so than Elicit's." Cites "400,000 monthly active users, tens of thousands of paying subscribers, and millions in annual revenue" growing through word-of-mouth.
What's Absent
- No sustained external critique of the "alignment windfall" framework or Elicit's specific safety claims. The safety community has not engaged seriously with whether Elicit is safety-positive, safety-neutral, or safety-negative.
- No 80,000 Hours podcast appearance -- unusual for an EA-funded org with this profile.
- No mechanism to differentially accelerate safety research over capabilities research. Elicit is domain-agnostic.
- No independent accuracy audit -- accuracy claims are self-reported.
- IP sale terms remain opaque -- equity percentage, liquidation preferences, and acquisition scenarios for Ought's stake are undisclosed.
- Paul Christiano's departure from the board (May 2024) has no public explanation, though a benign reading (Ought has minimal operations) is plausible.
- Owen Cotton-Barratt remains on Ought's board despite his 2023 EA community controversy (resigned from all other EA positions due to sexual misconduct investigation).
- No evidence that AI safety researchers are a significant user segment of Elicit, despite the safety claims.
Recommended Reading
2023 Cognitive Revolution podcast (most candid source) -- Andreas and Jungwon discuss the full arc from founding experiments to product. Honest about what didn't work. Nathan Labenz is a good interlocutor but is a disclosed investor. https://www.cognitiverevolution.ai/the-ai-reasoning-revolution-with-oughts-jungwon-byun-and-andreas-stuhlmuller/
Ought's Theory of Change (EA Forum) -- Community discussion including a former employee's critique that the theory is "very conjunctive" and questioning whether speeding up science is good or bad. https://forum.effectivealtruism.org/posts/raFAKyw7ofSo9mRQ3/ought-s-theory-of-change
"Supervise Process, not Outcomes" -- The core theoretical document. Clear, honest about limitations, acknowledges the bitter lesson. https://ought.org/updates/2022-04-06-process
"Discovering Alignment Windfalls Reduces AI Risk" -- The strategic argument that Elicit is AI safety work. The most explicit articulation of the theory of change. https://elicit.com/blog/alignment-windfalls
Vaniver's View on Factored Cognition -- Most substantive technical critique. https://www.alignmentforum.org/posts/J7Rnt8aJPH7MALkmq/vaniver-s-view-on-factored-cognition