← AI Safety Orgs

Forecasting Research Institute

Forecasting

Tetlock. Superforecasting for AI risk.

Founded
2022
HQ
Claymont, DE (registered); remote/distributed
Team
40
Structure
501(c)(3) nonprofit
Model
Grants

Theory of Change

FRI aims to improve decision-making on catastrophic risks by advancing the science of forecasting. The causal chain: better forecasting methods produce more reliable probability estimates, which create shared baselines for policy discussions, which lead to better-calibrated responses to existential risk.

Cofounder Ezra Karger draws an analogy to the Federal Reserve's Survey of Professional Forecasters: "If we're going to continue to have discussions about existential risks, it seems useful to have forecasts that we in the future will track over time that tell us how people's beliefs about risks are changing." He frames FRI's value not as producing definitive numbers but as making implicit beliefs explicit: "Experts think this; accurate forecasters think this. They might both be wrong, but we can at least start from here."

CEO Josh Rosenberg identifies the specific gap: "There still is this big gap where it seems like [forecasting is] not being taken up as much by important decision makers as it could be." FRI pursues a two-pronged strategy from its 2022 founding announcement: foundational research (handling low-probability events, long-run outcomes, complex topics) and translational work (making forecasting decision-relevant, mapping disagreements, identifying useful contexts).

What They Do

Existential Risk Persuasion Tournament (XPT, 2022): 169 participants produced the most widely-cited dataset of x-risk probability estimates. Domain experts: 6% extinction by 2100, 20% catastrophic risk. Superforecasters: 1% extinction, 9% catastrophic. AI specifically: 3% extinction (experts) vs. 0.38% (superforecasters). The 750,000x question-wording effect was discovered here -- changing the response format from "percentage" to "1-in-X with examples" shifted median extinction estimates from 5% to 1-in-15-million.

AI Adversarial Collaboration (2023): 22 participants debated AI risk for 8 weeks (median 80 hours for skeptics). "Concerned" group: 20% AI extinction. "Skeptical" group: 0.12%. Despite extensive engagement, views converged negligibly. The best short-term crux (METR finding dangerous AI capabilities) would close only 5% of the disagreement gap. Key finding: both groups agreed powerful AI would be developed by 2100 (90% and 88% respectively), but disagreed on timing -- the "skeptics'" median date for AI displacing humans was 2450, the "concerned" group's was 2045.

LEAP (launched June 2025): 339 experts across CS, industry, economics, and policy, plus 60 superforecasters and 1,400 public participants. Monthly surveys over 3 years. Key findings: median expert predicts 18% of work hours assisted by generative AI by 2030; experts predict significantly less progress than frontier lab CEOs but more than the public. In Wave 4 (Dec 2025), the median expert predicted 14% AI accuracy on LiveCodeBenchPro Hard -- GPT-5.2 hit 33% shortly after.

ForecastBench (Sept 2024): Benchmark comparing LLM vs. human forecasting accuracy. Accepted at ICLR 2025. GPT-4.5 beats the public median but not superforecasters, with AI parity projected late 2026.

Nuclear and Biorisk Forecasting: Nuclear study (110 experts + 41 superforecasters) found 1-5% catastrophe risk by 2045 and identified six tractable policies. Biorisk study found AI milestones would increase epidemic risk from 0.3% to 1.5%, but mitigation measures could reduce it back to baseline. In both domains, experts underestimated AI capabilities timelines.

Policy engagement: Contributed capability forecasts to the International AI Safety Report 2026. UN First Committee side event on nuclear risk. GiveWell contract ($200K) for water chlorination forecasting -- FRI's first non-x-risk applied project.

Key People

Philip Tetlock -- President, Chief Scientist. PIK Professor at UPenn (Wharton + SAS). Author of Expert Political Judgment (2005) and Superforecasting (2015). Co-created the Good Judgment Project (IARPA tournament winner). Also co-founder/advisor of Good Judgment Inc. (commercial forecasting). FRI is essentially "Tetlock's academic research program given institutional form."

Josh Rosenberg -- CEO (since mid-2023, replacing Page Hedley after 7 months). Former Senior Advisor at GiveWell. Focused on making forecasting decision-relevant for policymakers.

Ezra Karger -- Director of Research, Cofounder. Senior Economist at the Federal Reserve Bank of Chicago. PhD Economics, U of Chicago. Himself a superforecaster from IARPA tournaments. Receives no compensation from FRI. Also an advisor to Open Philanthropy (FRI's primary funder).

Team is ~15-20 core staff plus ~25 RAs/consultants. Remote/distributed. Board of 5 includes Tetlock and Rosenberg plus three academics.

Money and Incentives

Total known funding: ~$18.5M. Nearly 100% from a single source:

Source Amount Date Range
Coefficient Giving / Open Phil $18,325,525 2022-2025 (9 grants)
GiveWell $200,808 2025

The two largest grants: $10.08M general support (Feb 2025, 3-year) and $6.3M Science of Forecasting (Dec 2022, 3-year). Project-specific grants for LEAP ($1.07M), tripwire evaluations ($359K), ForecastBench ($100K), and others.

Financial data (990): 2023 expenses were $1.44M with $360K top compensation and $4.15M in net assets. The organization is still spending down initial grants -- the $10.08M 2025 grant significantly extends the runway.

Business model: Purely philanthropic grants. No product revenue, no government contracts, no commercial clients (the GiveWell project is itself grant-funded).

Funder dependency is extreme. Open Phil/Coefficient Giving provides 99%+ of all funding. This is the highest funder concentration of any AI safety org examined. Open Phil's "Forecasting" focus area (~$50M across 30+ grants) treats FRI as the primary vehicle.

Structural conflict: Ezra Karger (FRI's research director, uncompensated) simultaneously advises Open Philanthropy (FRI's near-sole funder). This is an unusual degree of entanglement between researcher and funder.

What Others Say

The fundamental methodological critique (Narayanan & Kapoor, AI Snake Oil/Normaltech, July 2024): "AI x-risk forecasts are far too unreliable to be useful for policy, and in fact highly misleading." They argue there's no reference class for AI extinction, subjective probabilities "are nothing more than guesses," forecast skill is undetectable for tail risks (proving one forecaster wrong over another on 1% vs. 0.001% events would require billions of observations), and FRI's own 750,000x question-wording effect proves "speculation gets laundered through pseudo-quantification." They cite FRI's XPT as "the most elaborate and well-executed x-risk forecasting exercise" while arguing its numbers are fundamentally unreliable. Notably, Karger was acknowledged for feedback on the paper.

The LEAP methodology error (EA Forum, Nov 2025, ~86 karma): A community member identified that FRI's LEAP survey (1) framed even the "slow progress" scenario as describing near-AGI, and (2) presented vote-share predictions as probabilities. A follow-up post provided a detailed breakdown of the error. FRI revised the report in response, which critics acknowledged positively.

Systematic AI underestimation (Epoch AI, Jan 2026): Independent analysis found XPT superforecasters gave only 9.7% average probability to observed AI progress outcomes (domain experts: 24.6%). This pattern persists across XPT, LEAP, biorisk, and virology benchmark timelines.

The adversarial collaboration reaction (BetterWithoutAI): "The experiment was a roaring success, in the sense that it confirmed that both groups were almost perfectly immune to evidence or arguments against their beliefs." Called both groups' estimates "essentially meaningless."

Joe Carlsmith on superforecaster numbers: After superforecasters reviewed his power-seeking AI report (giving 1% extinction risk vs. his 5%), he reported not updating heavily because their "written arguments haven't moved me much" and he's "unsure how much to defer to raw superforecaster numbers" for long-term questions.

Good Judgment Inc. on ForecastBench: Argued the benchmark's design advantages AI through a frozen human baseline, data-heavy questions, and multiple AI attempts. "Bullish is not the word here" -- superforecasters who predicted early AI parity simultaneously argued the milestone wouldn't mean much.

What's Absent

No documented policy impact. After $18M+ and 3+ years, there is no published case of any government, AI lab, or major institution changing a decision based on FRI forecasts. Contributing to the IAISR 2026 is the closest evidence.

Epistemic Audits disappeared. Listed in the founding announcement as a key service line. No evidence of any being conducted. The research page now references "Epistemic Reviews" as in progress.

Zero revenue diversification. No SFF, EA Funds, government grants, or private donors identified beyond CG/OP and GiveWell.

No published conflict-of-interest policy despite multiple structural conflicts (Tetlock-GJI, Karger-OP, CG participant selection).

No self-assessment or track record evaluation. An org devoted to calibration and accuracy has never published a systematic review of its own forecasting track record.

Recommended Reading

  1. 80K Hours podcast with Ezra Karger (#200, Sept 2024) -- The most candid insider view. Karger is disarmingly honest about what FRI can and cannot do. 2h49m. Link

  2. "AI existential risk probabilities are too unreliable to inform policy" (Normaltech, July 2024) -- The strongest counterargument to FRI's entire approach. Uses FRI's own data against them. Link

  3. Results from an Adversarial Collaboration on AI Risk (March 2024) -- FRI's most honest output. Shows that structured debate doesn't resolve deep disagreements about AI risk. Link

  4. How well did forecasters predict 2025 AI progress? (Epoch AI, Jan 2026) -- Independent evidence that the forecaster population FRI relies on systematically underestimates AI progress. Link

  5. LEAP Waves 1-3 Insights Report (Nov 2025) -- The most ambitious current output: 339 experts forecasting AI trajectory with detailed rationales. Link

Show Claude’s analysis
An opinionated read. Read the brief first to form your own view.

Stated Theory of Change

FRI's stated theory is that better forecasting methodology, rigorously applied, will produce more reliable probability estimates of catastrophic risks. These estimates create shared baselines for policy discussions. Decision-makers who start from calibrated probabilities will make better choices about risk mitigation. Over time, tracking how beliefs change produces an "inflation expectations for x-risk" -- a public good that improves collective decision-making.

The specific mechanism is not forecasting-as-prediction but forecasting-as-structured-disagreement: FRI's most important contribution is mapping where experts and superforecasters agree and disagree, identifying the cruxes that drive those disagreements, and publishing this information so others can decide what to do with it.

Revealed Theory of Change

FRI's actions substantially align with their stated mission but reveal some telling divergences:

Alignment. FRI genuinely produces rigorous forecasting research. The XPT, adversarial collaboration, and LEAP are all methodologically serious, transparent about their limitations, and openly published. The 750,000x question-wording effect was discovered and published by FRI, not by critics -- that takes intellectual honesty. They revised their LEAP report when publicly challenged on methodology. The research team (especially Karger) is unusually candid about what forecasting can and cannot do.

Divergence 1: The translational gap is widening, not closing. FRI was founded to make forecasting "useful to decision-makers." Three years and $18M+ later, there's no documented case of a decision being changed by FRI forecasts. Epistemic Audits -- the planned translational product -- appear to have been abandoned or deprioritized. The research output is excellent, but the causal chain from "better forecasts" to "better decisions" remains undemonstrated.

Divergence 2: The methodology keeps revealing its own limitations. FRI's most important findings are about why forecasting doesn't work for x-risk: the 750,000x question-wording effect, the systematic underestimation of AI progress, the near-zero convergence in adversarial collaborations, and the lack of correlation between short-term accuracy and long-term risk views. Each study strengthens the case that x-risk probabilities are unreliable -- which is the opposite of what you'd want from a methodology shop trying to make those probabilities useful.

Divergence 3: FRI is an academic research program, not a policy translation engine. The team is overwhelmingly academic. The CEO comes from GiveWell (evidence-based philanthropy), not policy. The research director's day job is at the Federal Reserve Bank of Chicago. The publications look like academic papers, not policy briefs. The Strategy Fellow role (newly created) explicitly acknowledges the gap.

Key Assumptions

Assumption 1: Quantified probability estimates for existential risk are useful, even if imprecise.

  • Evidence for: The XPT created reference numbers that didn't exist before. Researchers (Matt Clancy) use them in formal models. The numbers provide a starting point for structured discussion. Even imprecise numbers reveal the structure of disagreement.
  • Evidence against: Normaltech/AI Snake Oil argues the numbers are "feelings dressed up as numbers" that "launder speculation through pseudo-quantification." The 750,000x sensitivity to question wording suggests the numbers aren't measuring a stable quantity. Lina Khan cited 15% as "optimistic" on AI extinction, demonstrating how non-specialists misuse the numbers.
  • Testable: Partially. Near-term components can be tested (and have been -- with unflattering results). Long-term components cannot be.
  • If wrong: FRI produces an expensive public good that looks like information but isn't. The numbers anchor discussions without informing them.

Assumption 2: Superforecasters add value over domain experts for unprecedented risks.

  • Evidence for: Superforecasters have demonstrated accuracy on geopolitical questions. They're more calibrated on base rates and less susceptible to domain-specific biases. Their agreement with experts on non-anthropogenic extinction (asteroid risk etc.) shows calibration where base rates exist.
  • Evidence against: On AI specifically, domain experts have been more accurate on near-term benchmarks (24.6% vs. 9.7% average probability on observed outcomes). Superforecasters' reference-class reasoning breaks down when there IS no reference class. Carlsmith reports their arguments weren't persuasive on object-level AI risk questions.
  • Testable: Yes, as more XPT/LEAP questions resolve. Early evidence favors domain experts for AI-specific questions.
  • If wrong: The core innovation of the superforecasting methodology -- that generalist forecasting skill transfers across domains -- fails in the most important domain.

Assumption 3: Mapping disagreement structure is intrinsically valuable, even if it doesn't produce convergence.

  • Evidence for: The adversarial collaboration identified concrete cruxes (METR evaluations, major powers war) and revealed that disagreement is about worldviews (rate of change, trust in theoretical arguments) rather than factual disputes. This is genuinely new information. Knowing that both groups agree powerful AI is coming (88-90%) but disagree on timing (2045 vs. 2450) reframes the debate productively.
  • Evidence against: If the mapped disagreements reflect "personality factors" and "non-epistemic" influences (as FRI's own report suggests), then the mapping is sociologically interesting but epistemically uninformative. You've documented a cultural divide, not an intellectual one.
  • Testable: Partially, through long-run resolution of cruxes.
  • If wrong: FRI becomes a very expensive sociology of knowledge project with limited policy relevance.

Assumption 4: FRI can maintain intellectual independence despite extreme funder concentration.

  • Evidence for: FRI publishes findings unflattering to the methodology their funder invested in. The 750,000x effect, the lack of convergence, and the systematic AI underestimation are all findings CG/OP might prefer didn't exist. Karger is candid in interviews.
  • Evidence against: 99%+ funder concentration creates structural dependency regardless of individual integrity. Karger advises OP while directing FRI research. CG referred "concerned" participants for the adversarial collaboration. No conflict-of-interest policy exists. The board has no independent members.
  • Testable: Would FRI publish a finding that directly threatened CG funding? We don't know because it hasn't happened.
  • If wrong: FRI's research is subtly shaped by what's fundable rather than what's most important.

Strengths

  1. Intellectual honesty is genuine and unusual. FRI consistently publishes findings that undermine their own methodology's credibility (750,000x effect, lack of convergence, systematic AI underestimation). This is the opposite of the institutional incentive. Most orgs hide their failures; FRI publishes them as research findings. Karger's candor in the 80K Hours interview is remarkable for an org leader.

  2. Created the reference dataset for x-risk discussions. Before the XPT, there was no systematic, comparable dataset of expert and superforecaster estimates of catastrophic risks. Now there is. Whatever one thinks of the numbers' reliability, they provide a common reference point. Researchers cite them, policymakers reference them, and the structured disagreement data is genuinely new.

  3. Tetlock's credibility provides unmatched institutional legitimacy. FRI benefits from decades of forecasting research credibility. This isn't purchased prestige -- it's earned through pioneering work (Expert Political Judgment, Superforecasting, IARPA tournaments). No other organization could have convened the XPT or LEAP with comparable participant quality.

  4. LEAP is the most comprehensive expert panel on AI forecasting ever assembled. 339 experts including top-cited AI scientists, frontier lab employees, leading economists, and policy researchers. With 600K+ words of rationales across first three waves. The longitudinal design (3 years of monthly surveys) will produce uniquely valuable data on how expert views evolve.

  5. Research quality is high. Publications in peer-reviewed venues (ICLR 2025 for ForecastBench), XPT book, multiple working papers. The adversarial collaboration methodology is genuinely innovative. The conditional trees method for question generation is novel.

Weaknesses and Risks

  1. The translational gap is FRI's existential challenge. Three years and $18M+ with no documented policy impact. The founding promise was to make forecasting useful to decision-makers. Epistemic Audits were abandoned. The Strategy Fellow role is a tacit admission of failure on this front. If LEAP and XPT data don't reach policymakers in actionable form, FRI is an expensive academic exercise.

  2. FRI's own findings undermine confidence in its core product. The 750,000x question-wording effect, systematic AI underestimation, and adversarial collaboration non-convergence all suggest that x-risk probability estimates are unreliable. FRI is honest about this, but it creates a paradox: the organization best positioned to produce x-risk forecasts has produced the strongest evidence that such forecasts can't be trusted.

  3. Extreme funder concentration (99%+ from one source) is a governance failure. Open Phil/CG could terminate FRI by declining to renew grants. There's no institutional resilience. The Karger-OP advisory relationship further blurs the boundary. The board has no independent members. No conflict-of-interest policy exists. For an organization receiving $18M+, this governance structure is inadequate.

  4. Superforecasters' track record on AI is bad and getting worse. XPT superforecasters substantially underestimated AI progress on every measured benchmark. Epoch AI found they gave only 9.7% probability to observed outcomes. If superforecasters are systematically biased on the most important variable (AI progress), the entire methodology is compromised for the most important application.

  5. The academic orientation limits impact. FRI's team, publications, and culture are academic. This produces rigorous research but limits engagement with the policymakers and industry leaders who could actually use it. The intelligence community's failure to adopt forecasting (which Karger himself cites) is a cautionary precedent.

Cross-References

Complementary to Metaculus: Metaculus is the platform (crowd predictions on resolvable questions). FRI is the methodology shop (how to elicit, aggregate, and interpret forecasts, especially for unprecedented events). They share Open Phil as primary funder and forecasting as core domain. FRI's Project Improbable (low-probability elicitation) could directly improve Metaculus's accuracy.

Complementary to Epoch AI: Epoch AI forecasts AI progress via compute trends (data-driven). FRI forecasts via expert elicitation (judgment-driven). These are fundamentally different approaches that should be compared: when compute trend extrapolation and expert judgment disagree, which is more reliable? Neither org has pursued this comparison. Epoch's finding that XPT forecasters underestimated AI progress validates Epoch's data-driven approach over FRI's judgment-driven one, at least for near-term capability questions.

Relationship to Good Judgment Inc.: Tetlock co-founded both. GJI provides superforecasters that FRI uses. FRI research validates the methodology GJI commercializes. GJI has pushed back on ForecastBench. No institutional firewall exists.

Funded by the same source as many AI safety orgs: CG/OP funds FRI, Metaculus, Epoch, METR, and many others. This creates correlated risk: a shift in CG priorities could affect the entire ecosystem simultaneously.

What Would Change This Assessment

  • Documented policy impact: If a government or major institution published evidence that FRI forecasts changed a consequential decision, that would significantly strengthen the theory of change.
  • Resolution of LEAP/XPT cruxes: As questions resolve by 2030, we'll learn whether superforecasters or domain experts were more accurate on AI progress. If superforecasters dramatically outperform, that vindicates the methodology. If domain experts outperform (early evidence suggests this), FRI's reliance on superforecasters needs rethinking.
  • Revenue diversification: If FRI secured government contracts, lab partnerships, or diversified philanthropic support, the governance concern would diminish.
  • Epistemic Audits becoming real: If FRI actually delivered organizational decision tools (not just research papers), the translational gap would close.
  • A forecasting failure with consequences: If a policy decision explicitly based on FRI-type forecasts led to a bad outcome, it would reveal whether the methodology does harm as well as whether it does good.

Self-Critique

What sources should I have checked but didn't?

  • The full 800-page XPT technical report. I read the summaries and the Karger interview but not the full document. Key methodological details may be buried there.
  • The later sections of both Tetlock 80K Hours interviews (Oct 2025). I read summaries but didn't read them in their entirety. These likely contain updated thinking on FRI's direction.
  • Titotal's detailed post "Explaining a subtle but important error in the LEAP survey" (EA Forum, Dec 2025) -- referenced but not fetched due to forum blocking.

Where is this analysis potentially biased?

  • I may be too sympathetic to FRI's intellectual honesty, treating "we published our own failures" as sufficient evidence of rigor when it might be a strategy to preempt criticism.
  • I may overweight the Normaltech critique because it's analytically elegant. There may be strong defenses of x-risk forecasting that I haven't engaged with.
  • I may underweight the diffuse value of creating reference forecasts. Like GDP statistics, the value may be in how thousands of people use the numbers, not in any single traceable decision.

What would a thoughtful person who disagrees say? "You're demanding documented policy impact that no methodology organization can show. Climate science took decades to influence policy. Tetlock's first generation of forecasting research (Expert Political Judgment, 2005) influenced the intelligence community over 15+ years. FRI is 3 years old. The XPT numbers are cited everywhere -- in academic papers, EA strategy documents, policy briefs, and the IAISR 2026. You can't trace these citations to specific decisions because that's not how information influences policy. It diffuses. FRI's job is to produce the best possible information and trust the system. The fact that they publish findings that undermine their own methodology shows they're doing real science, not advocacy -- and real science is exactly what this field needs."

What's my single weakest claim? The claim that FRI has no documented policy impact. "No documented" is doing a lot of work. It's entirely possible that IAISR 2026 authors, Open Phil grantmakers, and government officials in various countries regularly consult FRI data and it meaningfully shapes their decisions. The absence of a paper trail doesn't mean the absence of influence. This may be a documentation gap, not an impact gap.

What information would most change my view? A candid account from a government policymaker or institutional decision-maker saying: "We used FRI's XPT/LEAP data to calibrate our risk assessment, and it changed what we decided to do about X." One verified instance of this would substantially change my assessment of the theory of change from "broken at the decision link" to "working but undersold."

Connected to (9)

Epoch AIcollaborator
Federal Reserve Bank of Chicagocollaborator · Ezra Karger
SecureBiocollaborator
GiveWellstaff from · Josh Rosenberg
Good Judgment Inc.collaborator · Philip Tetlock
Open Nuclear Networkcollaborator
METRcollaborator
Open Philanthropyadvisor at · Ezra Karger
University of Pennsylvaniacollaborator · Philip Tetlock
Sources (63)
Every URL that was read during research.
  1. 1.Forecasting Research Instituteforecastingresearch.org
  2. 2.Team — Forecasting Research Instituteforecastingresearch.org
  3. 3.Research — Forecasting Research Instituteforecastingresearch.org
  4. 4.XPT — Forecasting Research Instituteforecastingresearch.org
  5. 5.AI Adversarial Collaboration — Forecasting Research Instituteforecastingresearch.org
  6. 6.Forecasting Biosecurity Risks from LLMs — Forecasting Research Instituteforecastingresearch.org
  7. 7.Assessing Near-Term Accuracy in the Existential Risk Persuasion Tournament — Forecasting Research Instituteforecastingresearch.org
  8. 8.Longitudinal Expert AI Panelleap.forecastingresearch.org
  9. 9.ForecastBenchforecastbench.org
  10. 10.New evidence that AI predictions are meaningless | Better without AIbetterwithout.ai
  11. 11.Ezra Karger on what superforecasters and experts think about existential risks | 80,000 Hours80000hours.org
  12. 12.What did forecasters get right and wrong in the largest existential risk forecasting tournament?forecastingresearch.substack.com
  13. 13.How well can large language models predict the future?forecastingresearch.substack.com
  14. 14.Josh Rosenberg on Forecasting Researchtellingthefuture.substack.com
  15. 15.Prof Tetlock on predicting catastrophe, keeping your politics secret & when experts know more than you80000hours.org
  16. 16.Accurately predicting the future is central to everything. Here's how you can do it better.80000hours.org
  17. 17.Introducing LEAP: The Longitudinal Expert AI Panelforecastingresearch.substack.com
  18. 18.A Professional Superforecaster Walks Us Through His AI Progress Forecastsforecastingresearch.substack.com
  19. 19.Professor Philip Tetlock's forecasting researchfounderspledge.com
  20. 20.ONN-FRI First Committee hybrid side event ''A Gamble of Our Own Choosing: Forecasting Strategies to Reduce Nuclear Risks''opennuclear.org
  21. 21.Forecasting Research Institute — Soliciting Expert Forecasts on the Mortality Effect of Water Chlorination (April 2025)givewell.org
  22. 22.Philip E. Tetlock - Wikipediaen.wikipedia.org
  23. 23.Nuclear Risk — Forecasting Research Instituteforecastingresearch.org
  24. 24.Publications — Forecasting Research Instituteforecastingresearch.org
  25. 25.FRI-ONN Project: Forecasting Nuclear Catastropheopennuclear.org
  26. 26.Will Humanity Achieve a Century of Nuclear Peace? New Study Forecasts Nuclear Risk | Newswisenewswise.com
  27. 27.Careers — Forecasting Research Instituteforecastingresearch.org
  28. 28.Strategy Fellow — Forecasting Research Instituteforecastingresearch.org
  29. 29.Senior Researcher — Forecasting Research Instituteforecastingresearch.org
  30. 30.Research Analyst and Senior Research Analyst — Forecasting Research Instituteforecastingresearch.org
  31. 31.Student Research Assistant — Forecasting Research Instituteforecastingresearch.org
  32. 32.Superforecasting the premises in "Is power-seeking AI an existential risk?" - Joe Carlsmithjoecarlsmith.com
  33. 33.AI existential risk probabilities are too unreliable to inform policynormaltech.ai
  34. 34.Philip E. Tetlock on Forecasting and Foraging as a Fox (Ep. 93)conversationswithtyler.com
  35. 35.A tournament tried to test how well experts could forecast AI progress. They were all wrong. | 67nj67nj.org
  36. 36.What Superforecasters Actually Said About ForecastBenchgoodjudgment.substack.com
  37. 37.What Experts and Superforecasters Think About the Future of AI Research and Developmentforecastingresearch.substack.com
  38. 38.Longitudinal Expert AI Panelleap.forecastingresearch.org
  39. 39.Why experts and forecasters disagree about AI risk | 80,000 Hours80000hours.org
  40. 40.A major flaw in the Forecasting Research Institute's "Longitudinal Expert AI Panel" surveyea.greaterwrong.com
  41. 41.Chief of Staff — Forecasting Research Instituteforecastingresearch.org
  42. 42.Forecasting Research Institutegithub.com
  43. 43.Forecasting Research Institute - Nonprofit Explorer - ProPublicaprojects.propublica.org
  44. 44.How well did forecasters predict 2025 AI progress?epoch.ai
  45. 45.Forecasting the End of the Worldtellingthefuture.substack.com
  46. 46.Making Forecasting Scores Easier to Interpret: Introducing the Brier Indexforecastingresearch.substack.com
  47. 47.Announcing the Forecasting Research Institute (we’re hiring)ea.greaterwrong.com
  48. 48.About - Forecasting Research Instituteforecastingresearch.substack.com
  49. 49.Forecasting biosecurity risks from large language models and the efficacy of safeguardsforecastingresearch.substack.com
  50. 50.How Well Did Superforecasters and Experts Predict Wet Lab Skill Uplift from LLMs?forecastingresearch.substack.com
  51. 51.Forecastingopenphilanthropy.org
  52. 52.Conditional Trees — Forecasting Research Instituteforecastingresearch.org
  53. 53.Results from an Adversarial Collaboration on AI Risk (FRI)ea.greaterwrong.com
  54. 54.A selection of cross-cutting results from the XPTea.greaterwrong.com
  55. 55.Announcing "Forecasting Existential Risks: Evidence from a Long-Run Forecasting Tournament"ea.greaterwrong.com
  56. 56.Philip Tetlock - The Decision Labthedecisionlab.com
  57. 57.Longitudinal Expert AI Panelleap.forecastingresearch.org
  58. 58.Longitudinal Expert AI Panelleap.forecastingresearch.org
  59. 59.Longitudinal Expert AI Panelleap.forecastingresearch.org
  60. 60.Charity Navigator - Rating for Forecasting Research Institutecharitynavigator.org
  61. 61.Forecasting Research Institute (FRI)causeiq.com
  62. 62.Forecasting Research Institute - Organizationidealist.org
  63. 63.Tracking the Evolution of AI: The Longitudinal Expert AI Panelblog.irvingwb.com