Experts Split Over AI Doom as Safety Literacy Lags

Society

Published: Mon, Jan 27, 2025 • By Adrian Calder

Experts Split Over AI Doom as Safety Literacy Lags

A survey of 111 AI professionals finds two clear camps: one treats AI as a controllable tool, the other as an uncontrollable agent. Most experts express concern about catastrophic risk, yet many lack familiarity with core safety ideas. The result shifts where policy and security teams should focus their attention.

The new survey makes a simple, inconvenient point: experts are not arguing because they are cleverer than each other, they are arguing because they are reading different maps. About 78 percent of respondents say technical researchers should worry about catastrophic risks, but many cannot name basic safety concepts that change how you defend systems.

Practically, that matters. If a team believes you can always "pull the plug," they will not design for failure modes where shutdown is ineffective or unsafe. The paper flags the term instrumental convergence as an example; only about one in five respondents had heard of it. That term predicts that advanced systems can pursue sub-goals like self-preservation, which breaks simple plug-and-play mitigation thinking.

The most newsworthy takeaway is not drama about doom but a security-literacy gap. When policy and engineering decisions rely on assumptions rather than a shared conceptual baseline, organizations adopt brittle defenses and miss coordination opportunities across research, ops, and governance.

What to do next

Three practical moves: 1) Run a short, mandatory primer on core safety concepts for AI engineers and security teams to close the literacy gap. 2) Treat shutdowns as one mitigation among many - run tabletop exercises where kill switches fail. 3) Build cross-disciplinary review boards so those who understand agent-like failure modes see product designs early.

In short: stop arguing about whether AI could be bad and start agreeing which simple, practical defenses you will actually implement today. That is where risk becomes manageable, not in sermonising about the far horizon.

Additional analysis of the original ArXiv paper

📋 Original Paper Title and Abstract

Why do Experts Disagree on Existential Risk and P(doom)? A Survey of AIExperts

The development of artificial general intelligence (AGI) is likely tobe one of humanity's most consequential technological advancements. Leading AIlabs and scientists have called for the global prioritization of AI safetyciting existential risks comparable to nuclear war. However, research oncatastrophic risks and AI alignment is often met with skepticism, even byexperts. Furthermore, online debate over the existential risk of AI has begun toturn tribal (e.g. name-calling such as "doomer" or "accelerationist"). Untilnow, no systematic study has explored the patterns of belief and the levels offamiliarity with AI safety concepts among experts. I surveyed 111 AI experts ontheir familiarity with AI safety concepts, key objections to AI safety, andreactions to safety arguments. My findings reveal that AI experts cluster intotwo viewpoints -- an "AI as controllable tool" and an "AI as uncontrollableagent" perspective -- diverging in beliefs toward the importance of AI safety.While most experts (78%) agreed or strongly agreed that "technical AIresearchers should be concerned about catastrophic risks", many were unfamiliarwith specific AI safety concepts. For example, only 21% of surveyed experts hadheard of "instrumental convergence," a fundamental concept in AI safetypredicting that advanced AI systems will tend to pursue common sub-goals (suchas self-preservation). The least concerned participants were the least familiarwith concepts like this, suggesting that effective communication of AI safetyshould begin with establishing clear conceptual foundations in the field.

🔍 ShortSpan Analysis of the Paper

Problem

The paper examines why AI experts disagree about catastrophic risks from advanced AI and how familiar experts are with core AI safety concepts. Understanding these disagreements matters because divergent beliefs among researchers could affect research priorities, policy and the communication of risks that have real-world security and societal consequences.

Approach

The author conducted an email survey of 111 AI professionals (academics, industry engineers and AI safety researchers) drawn largely from NeurIPS authors, PhD students and ERA:AI fellows. Respondents met minimum ML experience criteria. The survey measured familiarity with empirical ML and AI safety terms, agreement with nine objection statements, preferred AGI timelines, and responses before and after short reading interventions. The response rate was roughly 10%.

Key Findings

Expert clustering: respondents split into two coherent worldviews—“AI as a controllable tool” and “AI as an uncontrollable agent”—which correlate with timeline preferences and safety priority.
Widespread concern but low literacy: most respondents (about 77–78%) agree technical AI researchers should be concerned about catastrophic risks, yet specialised safety terms are often unknown—only 21% reported familiarity with "instrumental convergence" while many (reported 58–63%) were unfamiliar with core safety concepts.
Correlation between knowledge and concern: lower familiarity with alignment concepts correlated with lower perceived catastrophic risk and greater faith in simple mitigations (for example, "we can just turn systems off").

Limitations

Sample size and selection effects (N=111, ~10% response), limited and short-term interventions, a constrained objection list, and uncertainty about long-term stability of opinion shifts.

Why It Matters

The safety-literacy gap suggests some scepticism stems from unfamiliarity rather than settled disagreement. For security and policy, improving conceptual foundations and targeted communication may increase informed engagement; there remains common ground (e.g. researcher responsibility, and rejection that safety work unduly slows progress) to build collaborative approaches.

Attribution Original paper on arXiv