← AI Safety Orgs

Ought / Elicit

Research

Research → product pivot (Elicit).

Founded
2017
HQ
Covina, CA
Team
0
Structure
501(c)(3) nonprofit
Model
Grants

Theory of Change

Ought was founded in 2017 with a theory of change rooted in factored cognition: the idea that complex reasoning tasks can be decomposed into smaller sub-tasks that are independently evaluable, enabling humans to supervise AI systems even when those systems exceed human capability on the full task. This drew directly from Paul Christiano's Iterated Distillation and Amplification framework.

Andreas Stuhlmuller, from a 2020 EA talk: "Can you somehow assign a mechanism that arranges your interaction with [AI] experts, such that they try to be as helpful to you as an expert who intrinsically cares about you? That's the problem."

The theory evolved in 2022 into a broader framework: process-based vs. outcome-based ML systems. Process-based systems are transparent, decomposed, human-supervisable. Outcome-based systems are end-to-end optimized, opaque, and prone to gaming metrics. The argument: process-based systems have (1) better differential capabilities for tasks without clear outcome metrics, (2) better alignment properties long-term, and (3) both architectures are "attractors" with lock-in effects, so pushing for process now matters.

The practical instantiation is Elicit, an AI research assistant. The "alignment windfall" framing (articulated by James Brady, Head of Eng): "For Elicit, creating a valuable product is the same thing as building a truthful, transparent system. Trustworthiness is our value proposition." If Elicit succeeds commercially using process-based architectures, it becomes an "existence proof" that others copy -- shifting the landscape toward more alignable approaches.

What They Do

2017-2020 (research phase): Human experiments on factored cognition. Participants given small context windows to solve tasks cooperatively. Key finding: error propagation is the critical challenge -- you need error correction mechanisms that are themselves composable from small pieces.

2021-2023 (product launch): Elicit launched as an AI research assistant for literature review. Users ask research questions, receive structured tables of papers with AI-extracted data (population, intervention, outcomes, limitations). Each answer linked to specific paper passages. 200K monthly users by September 2023.

Sept 2023-present (PBC era): Elicit spun off as a Public Benefit Corporation. Raised $9M seed (Fifty Years), then $22M Series A at $100M valuation (Spark Capital + Footwork, Feb 2025). 400K+ monthly active users. Estimated $18-22M ARR. Shipped 1.4 features per week on average. Major launches: Notebooks, Research Agents, Systematic Reviews, Reports.

Technical approach: Multi-model ensemble (fine-tuned Flan-T5, GPT-3.5/4, Claude). Hybrid semantic + lexical search with multi-stage re-ranking. Chain of thought for all extractions. Factored verification to detect hallucinations (published at WIESP 2023 -- reduced hallucinations by ~40-50% via self-correction). Constitutional AI for training data generation.

Research output: Publication trajectory shifted from theoretical (factored cognition experiments) to applied (factored verification, iterated decomposition) to product-oriented. Key papers: Iterated Decomposition (2023, improved science Q&A from 25% to 65% accuracy), Factored Verification (2023), Supervise Process not Outcomes (2022), RAFT benchmark (NeurIPS 2021).

Open source: ICE (Interactive Composition Explorer) for compositional LM programs. Factored Cognition Primer tutorial.

Key People

Andreas Stuhlmuller -- CEO and co-founder. PhD Cognitive Science MIT (Tenenbaum), postdoc Stanford (Goodman). 20+ publications in probabilistic programming and computational cognitive science. Has been thinking about automating reasoning since his teens. Compensation at Ought was $120-167K -- well below market.

Jungwon Byun -- COO and co-founder. BA Economics Yale. Former Head of Growth at Upstart (helped scale 5x). Brings operational and growth expertise. Unusually effective at communicating the technical vision.

Owain Evans -- Board member and CEO of Ought (as of FY2025). Published TruthfulQA, Emergent Misalignment (Nature 2026), Reversal Curse. Runs Truthful AI research group in Berkeley. World-class AI safety researcher whose continued involvement adds significant credibility.

Notable departure: William Saunders, early Ought software engineer, co-authored key papers, later joined OpenAI alignment team, then resigned Feb 2024 citing safety concerns. Said he asked himself if OpenAI was "more like the Apollo program or more like the Titanic."

Team: ~25-30 at Elicit. Mix of EA-adjacent (GiveWell), academic (PhD philosophy, PhD causality/safety), and strong tech (Stripe, Square, Figma, Google) backgrounds.

Money and Incentives

Ought (nonprofit) -- ~$9M total grants:

  • Open Philanthropy: $3.1M (2018-2020, "general support" under "Navigating Transformative AI")
  • FTX Future Fund: ~$5M (mid-2022, for "building Elicit" -- disbursed before FTX collapse)
  • Survival and Flourishing Fund: ~$800K
  • EA Funds: ~$60K

Elicit (PBC) -- $31M VC + growing product revenue:

  • $9M seed (Sept 2023, Fifty Years led)
  • $22M Series A (Feb 2025, Spark Capital + Footwork, $100M valuation)
  • Angel investors: Jeff Dean (Google), Tom Preston-Werner (GitHub), Thomas Ebeling (ex-Novartis CEO)
  • Estimated $18-22M ARR in 2025; $1M ARR within 4 months of launching paid tier

The transition: Ought sold its IP to Elicit PBC at an independent valuation (~$8.2M per FY2025 990). Ought now holds ~$10M in Elicit equity at carrying value. Zero employees, zero operations. The nonprofit is effectively a dormant equity holding company that could reactivate if Elicit returns profits.

Incentive alignment (claimed): Accuracy IS the value proposition. Building trustworthy AI IS building a good product. No lab funding, no compute dependency, no board overlap with frontier labs. This is the cleanest financial structure of any org claiming to do both commercial AI and safety work.

Incentive concerns (structural): As a VC-backed company, Elicit faces growth pressure. If end-to-end approaches become more accurate than process-based decomposition for certain tasks, will Elicit maintain its commitment to transparency, or will it adopt opaque methods to stay competitive? The PBC structure provides some protection, but ultimately, a PBC board can redefine its benefit obligations.

What Others Say

Vaniver (AI Alignment Forum): Factored cognition has a tree structure limitation -- horizontal communication between branches is impossible, leading to duplicated effort. Additionally, there is a safety-generality tradeoff: FC "allows you to stick to cognitive strategies that definitely solve a problem in a safe way" but may not allow developing new strategies without opening the door to inner optimizers.

Former Ought employee (Eli, EA Forum): Considers the theory of change "very conjunctive" -- process-based systems must be competitive with end-to-end training for AGI, which may be unlikely. Also flags uncertainty about "whether generally speeding up science is good or bad."

Andreas himself (acknowledging the "bitter lesson"): "The history of machine learning is the bitter lesson of outcomes winning. Vision and NLP started with more structured systems, which were replaced with end-to-end systems." He argues this pattern may not hold for high-stakes reasoning where transparency matters, but acknowledges this is "up in the air."

Jungwon (on RLHF calibration): "A lot of the reinforcement learning is causing the models to be incredibly uncalibrated and overconfident in their answers. And so that is preventing us from being able to use these models in particular ways." A candid acknowledgment of dependence on upstream model properties Elicit cannot control.

Footwork Ventures (investor, so discount accordingly): "Perhaps no company's story exemplifies [lifelong commitment] more so than Elicit's." Cites "400,000 monthly active users, tens of thousands of paying subscribers, and millions in annual revenue" growing through word-of-mouth.

What's Absent

  • No sustained external critique of the "alignment windfall" framework or Elicit's specific safety claims. The safety community has not engaged seriously with whether Elicit is safety-positive, safety-neutral, or safety-negative.
  • No 80,000 Hours podcast appearance -- unusual for an EA-funded org with this profile.
  • No mechanism to differentially accelerate safety research over capabilities research. Elicit is domain-agnostic.
  • No independent accuracy audit -- accuracy claims are self-reported.
  • IP sale terms remain opaque -- equity percentage, liquidation preferences, and acquisition scenarios for Ought's stake are undisclosed.
  • Paul Christiano's departure from the board (May 2024) has no public explanation, though a benign reading (Ought has minimal operations) is plausible.
  • Owen Cotton-Barratt remains on Ought's board despite his 2023 EA community controversy (resigned from all other EA positions due to sexual misconduct investigation).
  • No evidence that AI safety researchers are a significant user segment of Elicit, despite the safety claims.

Recommended Reading

  1. 2023 Cognitive Revolution podcast (most candid source) -- Andreas and Jungwon discuss the full arc from founding experiments to product. Honest about what didn't work. Nathan Labenz is a good interlocutor but is a disclosed investor. https://www.cognitiverevolution.ai/the-ai-reasoning-revolution-with-oughts-jungwon-byun-and-andreas-stuhlmuller/

  2. Ought's Theory of Change (EA Forum) -- Community discussion including a former employee's critique that the theory is "very conjunctive" and questioning whether speeding up science is good or bad. https://forum.effectivealtruism.org/posts/raFAKyw7ofSo9mRQ3/ought-s-theory-of-change

  3. "Supervise Process, not Outcomes" -- The core theoretical document. Clear, honest about limitations, acknowledges the bitter lesson. https://ought.org/updates/2022-04-06-process

  4. "Discovering Alignment Windfalls Reduces AI Risk" -- The strategic argument that Elicit is AI safety work. The most explicit articulation of the theory of change. https://elicit.com/blog/alignment-windfalls

  5. Vaniver's View on Factored Cognition -- Most substantive technical critique. https://www.alignmentforum.org/posts/J7Rnt8aJPH7MALkmq/vaniver-s-view-on-factored-cognition

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

Ought/Elicit claims two mechanisms for reducing AI risk:

  1. Improving epistemics: By making high-quality, evidence-based reasoning cheaper and faster, Elicit helps researchers (including alignment researchers), policymakers, and decision-makers navigate the transition to advanced AI more wisely.

  2. Pioneering process supervision: By building a commercially successful product using process-based (decomposed, transparent) architectures, Elicit creates an "existence proof" that this approach works. Other companies copy it out of self-interest. The landscape of AI development shifts toward more alignable architectures.

The causal chain: Elicit demonstrates factored cognition works commercially -> Other companies adopt similar approaches -> The default AI architecture becomes more transparent and supervisable -> Advanced AI systems are built on process-based foundations -> Alignment is easier.

Revealed Theory of Change

What Ought/Elicit's actions suggest about what they actually believe and optimize for:

Actions consistent with stated theory:

  • Genuine commitment to process-based architecture (task decomposition, citations, transparent reasoning) even when it's harder to build.
  • Published Factored Verification paper that others can adopt.
  • Open-sourced ICE framework.
  • Andreas's "Will We Get Wise Enough Fast Enough?" talk shows genuine concern about whether epistemics can keep pace with capability.
  • Modest compensation during nonprofit years ($120-167K for the CEO) suggests real mission motivation.

Actions that complicate the stated theory:

  • The product is domain-agnostic -- it accelerates ALL research equally. There is no mechanism to differentially accelerate safety work.
  • Primary growth has been in biomedicine, engineering, and humanities -- not AI safety research.
  • The PBC transition was driven by practical necessity (FTX collapse threatened funding model, product needed VC to scale) as much as strategic vision.
  • Publication output has shifted from theoretical safety work to product-oriented research. The factored cognition research program was effectively abandoned in favor of building Elicit.
  • No evidence of efforts to attract AI safety researchers as a user segment.

The honest picture: Elicit is a genuine, well-built product that embodies process-based principles. The founders are sincere about the safety connection. But the causal chain from "Elicit succeeds" to "AI risk reduced" involves multiple untested assumptions, and the product development has naturally pulled the org away from direct safety work toward commercial growth.

Key Assumptions

Assumption 1: Process-based systems can remain competitive with end-to-end systems.

  • Evidence for: Elicit's product success with 400K+ users and $18-22M ARR suggests the approach works for current tasks.
  • Evidence against: The "bitter lesson" of ML is that end-to-end optimization eventually wins. Every previous structured approach in vision, NLP, and games was eventually superseded.
  • Testable: Yes, on a 2-5 year horizon. If frontier models can do systematic reviews end-to-end better than decomposed approaches, the assumption fails.
  • If wrong: Elicit either adapts (abandoning process-based approach) or loses market position. Either outcome undermines the safety theory of change.

Assumption 2: Commercial success of process-based systems shifts the broader AI landscape.

  • Evidence for: Labs have expressed interest in process supervision. OpenAI published a process-based supervision paper.
  • Evidence against: Labs' expressed interest has not translated into meaningful investment. End-to-end training remains overwhelmingly dominant. Elicit is tiny relative to the frontier lab ecosystem.
  • Testable: Partially. We can observe whether labs actually adopt process-based approaches at scale.
  • If wrong: Elicit is a nice research tool that has no impact on how AI systems are actually built.

Assumption 3: Faster, better reasoning tools are net positive for AI safety.

  • Evidence for: Better decision-making is plausibly good in expectation. Systematic reviews of AI governance options, risk assessments, etc. could be valuable.
  • Evidence against: Elicit accelerates ALL research, including capabilities research. If it makes it 20% faster to survey the literature on scaling laws, that helps capabilities researchers too. No differential mechanism exists.
  • Testable: In principle, but very difficult -- would require measuring downstream effects on safety vs. capabilities research.
  • If wrong: Elicit is a general productivity tool that modestly accelerates the AI race.

Assumption 4: The nonprofit-to-PBC transition preserved the safety mission.

  • Evidence for: PBC structure, stated mission, founder credibility, clean financial structure (no lab dependencies).
  • Evidence against: VC investors optimize for growth and returns. The PBC benefit obligation is not enforceable in the way nonprofit status is. Commercial pressure to add features, expand markets, and grow revenue will increasingly compete with safety-oriented architecture decisions.
  • Testable: Yes, by observing whether Elicit maintains its commitment to transparency as competitive pressure increases.
  • If wrong: Elicit becomes a standard AI SaaS product that no longer embodies process-based principles.

Strengths

  1. Intellectual coherence: The factored cognition -> process supervision -> Elicit product arc is genuinely coherent. Unlike many orgs that retrofit safety narratives onto existing work, Ought's product is a natural expression of its founding research vision.

  2. Clean financial structure: No lab funding, no compute dependency, no board overlap with frontier labs. This is rare among orgs claiming to do AI safety work while also building AI products.

  3. Founder credibility: Andreas has deep technical credentials (MIT PhD, Stanford postdoc, 20+ publications) and has been working on this problem for over a decade. The below-market compensation during nonprofit years is consistent with genuine mission motivation.

  4. Real product-market fit: 400K+ MAU, $18-22M ARR, $100M valuation, word-of-mouth growth. This is not vaporware -- the product genuinely works and users pay for it.

  5. Concrete safety-relevant contribution: Factored Verification is a practical technique for reducing hallucinations that others can and do adopt. This is more concrete than most safety org outputs.

  6. Strong board: Owain Evans (TruthfulQA, Emergent Misalignment) brings world-class AI safety credibility to the Ought board.

Weaknesses and Risks

  1. The "conjunctive" problem: The theory of change requires multiple independent assumptions to all hold simultaneously. Process-based systems must be competitive AND commercially successful AND adopted by others AND this must shift the AI development landscape AND the net effect must be safety-positive. Each conjunction reduces probability.

  2. No differential safety mechanism: Elicit accelerates all research equally. There is no reason to believe it disproportionately helps safety researchers. The most safety-positive version of Elicit would specifically prioritize safety applications, but this would conflict with commercial growth incentives.

  3. The bitter lesson looms: End-to-end ML systems have historically replaced every structured approach. If GPT-5 or equivalent can do systematic reviews end-to-end better than Elicit's decomposed approach, the core product thesis and the safety thesis collapse simultaneously.

  4. Scale mismatch: Elicit is a ~25-person company with $31M in VC funding. Frontier labs spend billions. The notion that Elicit's success will meaningfully shift how labs build AI systems is plausible but requires extraordinary influence relative to scale.

  5. Mission drift risk: The transition from nonprofit safety research to VC-backed SaaS product is a classic path toward mission drift. The PBC structure provides some protection but not enforcement. As revenue grows, the pressure to optimize for growth over safety-oriented architecture will increase.

  6. Owen Cotton-Barratt on the board: His 2023 EA controversy (resigned from all other EA positions) could become a reputational liability if Ought reactivates or the board becomes publicly visible.

Cross-References

  • Paul Christiano / ARC: Christiano's Iterated Amplification framework is the intellectual ancestor of factored cognition. He was on Ought's board until May 2024. ARC's research on scalable oversight is theoretically complementary to Elicit's practical approach.
  • Owain Evans / Truthful AI: Evans's research on truthfulness, emergent misalignment, and LLM behavior is directly relevant to Elicit's mission. He bridges the gap between Ought's board and frontier safety research.
  • William Saunders -> OpenAI: Saunders moved from Ought to OpenAI's alignment team, suggesting Ought's research environment prepared people for serious safety work. His subsequent resignation over safety concerns at OpenAI is independently informative.
  • Competitors: Consensus AI (consensus views across studies), Semantic Scholar (free, larger database), Scite (citation context). Elicit differentiates on depth of processing and transparent methodology. None of the competitors make safety claims.
  • Open Philanthropy: Funded Ought's early years under "Navigating Transformative AI." The fact that Open Phil classified Ought as AI safety work (not general science) is noteworthy.

What Would Change This Assessment

Upward updates:

  • Evidence that AI safety researchers are a significant and growing Elicit user segment.
  • A frontier lab explicitly adopts process-based supervision citing Elicit's influence.
  • Elicit introduces differential features for safety research (e.g., free/discounted access for alignment researchers).
  • Independent accuracy audit confirms Elicit's claims and shows clear advantage over end-to-end approaches.

Downward updates:

  • Elicit abandons process-based architecture for end-to-end approaches to stay competitive.
  • Evidence that Elicit primarily accelerates capabilities research.
  • VC investors push for growth that conflicts with safety-oriented design choices.
  • Frontier models make the decomposed approach obsolete for Elicit's core use cases.
  • The Ought nonprofit's equity stake is diluted or structured in ways that prevent it from funding future safety work.

Self-Critique

What's weakest in this analysis:

  • I have limited access to the actual forum discussions about Ought's theory of change (forum domains are restricted). The "conjunctive theory" critique and the "speeding up science" concern are from web search summaries, not full reading.
  • I may be over-crediting the alignment windfall argument because it's intellectually compelling even if empirically unvalidated.
  • The competitive landscape analysis is shallow -- I have not used Elicit, Consensus, or Semantic Scholar to directly compare.

What a thoughtful disagreer would say: "You're treating Elicit's safety claims too seriously. It's a research tool SaaS company. The safety framing is a founder narrative that helped raise philanthropic funding early on, and it persists because it differentiates from competitors and appeals to EA-adjacent investors. The actual safety impact is approximately zero -- no frontier lab is going to change its architecture because Elicit has 400K users."

My single weakest claim: That Elicit's success will "shift the landscape" toward process-based AI development. This is the most speculative link in the theory of change, and I found no evidence that it's happening.

What would most change my view: Evidence (positive or negative) about whether AI safety researchers actually use Elicit, and whether Elicit's process-based approach is actually being adopted by other companies building AI systems.

Connected to (5)

Sources (38)
Every URL that was read during research.
  1. 1.Oughtought.org
  2. 2.Elicit | Oughtought.org
  3. 3.Elicit Raises $9 Million and Becomes a Public Benefit Corporation - Elicitblog.elicit.com
  4. 4.Elicit is building a tool to automate scientific literature review | TechCrunchtechcrunch.com
  5. 5.Andreas Stuhlmüllerstuhlmueller.org
  6. 6.Elicitnbt.substack.com
  7. 7.Elicit Raises $22M to Build the Most Trusted AI Platform for Evidence-Backed Decisions - Elicitelicit.com
  8. 8.Factored Cognition | Oughtought.org
  9. 9.Supervise Process, not Outcomes | Oughtought.org
  10. 10.Team | Elicit: The AI Research Assistantelicit.com
  11. 11.Unbounded AI-Assisted Research with Elicit Founders Andreas Stuhlmüller and Jungwon Byuncognitiverevolution.ai
  12. 12.The AI Reasoning Revolution with Ought's Jungwon Byun and Andreas Stuhlmüllercognitiverevolution.ai
  13. 13.Elicit's $22M Series A - Deploying AI to radically increase good reasoning in the worldcreativerly.com
  14. 14.Discovering Alignment Windfalls Reduces AI Risk - Elicitelicit.com
  15. 15.AI Safety Needs Great Product Buildersjmsbrdy.com
  16. 16.Elicit and AI Safety - Elicitelicit.com
  17. 17.Elicit's Mission Is to Scale Up Good Reasoning - Elicitelicit.com
  18. 18.Will We Get Wise Enough Fast Enough? - Elicitelicit.com
  19. 19.Joining Oughtmaggieappleton.com
  20. 20.Owain Evansowainevans.github.io
  21. 21.Team | Oughtought.org
  22. 22.Factored Verification: Detecting and Reducing Hallucinations in Frontier Models Using AI Supervision - Elicitelicit.com
  23. 23.Factored Cognition Primer | Primerprimer.ought.org
  24. 24.Ep 23: Jungwon Byun, Andreas Stuhlmüller, and the future of research. - Futurati Podcastfuturatipodcast.com
  25. 25.Elicit's Core Bets: Be Systematic, Transparent, and Unbounded - Elicitelicit.com
  26. 26.Pricing | Elicit: The AI Research Assistantelicit.com
  27. 27.Elicit Revenue Model 2026 | How AI Research Platforms Make Moneymiracuves.com
  28. 28.Elicit Blog - Al for scientific researchelicit.com
  29. 29.New in Elicit: Research Agents - Elicitelicit.com
  30. 30.What is Brief History of Elicit Company?canvasbusinessmodel.com
  31. 31.Elicitpmc.ncbi.nlm.nih.gov
  32. 32.Careers | Elicit: The AI Research Assistantelicit.com
  33. 33.Ought Inc - Nonprofit Explorer - ProPublicaprojects.propublica.org
  34. 34.Ought has spun off Elicit | Oughtought.org
  35. 35.A Library and Tutorial for Factored Cognition with Language Models | Oughtought.org
  36. 36.Andreas Stuhlmüller: Training ML Systems to Answer Open-ended Questions | Effective Altruismeffectivealtruism.org
  37. 37.Elicit's limitationssupport.elicit.com
  38. 38.Introducing Elicit Systematic Review - Elicitelicit.com