Welcome to Science Reader!

What we do here

Saturday , 4 July 2026

Welcome to Science Reader!

What we do here

Saturday , 4 July 2026

HomeThe New IntelligenceAI In Science Connects the Dots, But Only In Fields That Are Fragmented

AI In Science Connects the Dots, But Only In Fields That Are Fragmented

An analysis of 80 million papers shows AI boosts originality where knowledge is scattered and connections are weak, but contributes little novelty in structured science.

duodeskJuly 3, 202642

AI and computer science

The New Intelligence · Explore this series ›

Key Takeaways

AI lifts novelty in rough fields like astronomy, reverses to automation in codified ones like cell biology.
AI's citation premium reflects social rewards, not discovery, independent of knowledge structure.
AI papers are 30–40% more likely than field-mates to introduce conceptual or linguistic novelty.

On 9 October 2024 the Royal Swedish Academy of Sciences gave the Nobel Prize in Chemistry to three men, two of whom had built a machine. Demis Hassabis and John Jumper of Google DeepMind had taught a neural network, AlphaFold, to predict the folded shapes of proteins. Marking the field's grandest honour, DeepMind distilled the optimist's creed into a single sentence: AI "will make science faster and ultimately help to understand disease and develop therapeutics."

DeepMind's policy team had already gone further, announcing in a manifesto "a new golden age of discovery." A machine had, it seemed, originated something, and the establishment had blessed it.

Not everyone was nodding. Seven months earlier, Lisa Messeri of Yale and M. J. Crockett of Princeton had set down in Nature a sentence that has hung over the field since: the proliferation of AI tools "risks introducing a phase of scientific enquiry in which we produce more but understand less." Their deeper worry was monoculture: "scientific monocultures," they wrote, "in which some types of methods, questions and viewpoints come to dominate," the peripheral vision of a whole field quietly narrowing.

The proliferation of AI tools in science risks introducing a phase of scientific enquiry in which we produce more but understand less.
Lisa Messeri & M. J. Crockett, Yale & Princeton

For two years the argument between the two camps ran almost entirely on anecdote. Each side pointed at the same few triumphs and the same few terrors. Almost nobody had measured what AI does to scientific creativity across the whole of science.

Then, in April 2026, someone counted. Stefano Bianchini, an economist at BETA in Strasbourg, with Valentina Di Girolamo, Julien Ravet and David Arranz of the European Commission's research directorate, took more than eighty million papers from the OpenAlex database, spanning 2005 to 2023 and 172 fields. They asked a narrower, answerable question than "is AI good for science." When a paper uses AI, is it more novel (does it coin new words, new phrases, new combinations of ideas) and more impactful (does it get cited)?

Knowledge-space roughness

A measure of how rugged a field's terrain of ideas is - fragmented into disconnected clusters and hard to traverse, versus a smooth, well-connected plain. The paper finds AI lifts novelty most where the terrain is roughest.

Key figure

30–40%

How much more likely AI papers are to introduce linguistic or conceptual novelty than their field-mates

Their headline answer was probably what the AI optimists wanted to hear. AI papers are roughly 30 to 40 percent more likely than their field-mates to introduce linguistic or conceptual novelty, and they crowd the ranks of the most-cited. "The use of AI," the authors write, "is associated with more novel and highly cited research." Then the sentence turns, in the dry register of a results section: "However, these benefits are far from uniform."

Whether AI lifts novelty, it turns out, depends almost entirely on where it is used. Picture a field's knowledge as a landscape. Where the terrain is rugged, fragmented into disconnected peaks, combinatorially complex and hard to traverse, AI behaves like a pathfinder and lifts novelty sharply. Where the terrain is a smooth plain, well-trodden and well-connected, AI merely lets researchers walk the familiar paths faster, and novelty barely moves at all.

The effect strengthens with two things at once: a field's roughness, and its "AI exposure," how deeply the tools have penetrated it. Astronomy, bioinformatics, climatology, geodesy, medical physics and radiology sit at the strong end. In fields with standardised, codified protocols, cell biology, molecular biology, traditional medicine, the association runs the other way: AI shows its strongest negative novelty, working as technical automation, classifying and predicting without ever troubling the conceptual frontier.

The paper separates two things the debate routinely fuses: novelty and impact. Novelty, the saying of new things, tracks the structure of knowledge exactly as you would expect. Citations are a different story altogether: the terrain that amplifies novelty leaves the citation premium untouched, knowledge structure explains barely any of it, and the average effect on citation counts is, in the authors' own word, "modest."

The authors are careful about what they will and will not say: impact "may rather be shaped by social and institutional mechanisms (e.g., visibility, collaboration networks, reputational effects), whose dynamics operate independently of the cognitive structures captured by knowledge-space roughness."

My take on this is that the citations AI brings may be something older at work, the social machinery of science rewarding whoever picks up the shiny new instrument first. What the paper claims is narrower than it sounds: the reward and the discovery have come apart. It does not claim that AI has stopped aiding discovery.

...the reward and the discovery have come apart.

AI does not only impact analysis and research. In a 2026 Nature paper, James Evans of Chicago, Fengli Xu of Tsinghua and their co-authors tracked what happens to the scientists themselves. Those who adopt AI, who differ from their peers in much else besides, publish 3.02 times more, are cited 4.84 times more, and become project leaders 1.37 years earlier than those who do not. The same adoption shrinks the collective volume of topics science studies by 4.63 percent and cuts scientist-to-scientist engagement by 22 percent, because AI work "moves collectively toward areas richest in data."

Their verdict echoes Messeri and Crockett: "AI tools appear to automate established fields rather than explore new ones, highlighting a tension between personal advancement and collective scientific progress."

AI tools appear to automate established fields rather than explore new ones, highlighting a tension between personal advancement and collective scientific progress.
James Evans & Fengli Xu, Chicago & Tsinghua

Jian Gao and Dashun Wang of Northwestern, measuring the same citation premium in 2024, added a further discomfort: its benefits skew away from fields with more women and Black scientists. It echoes the tension Evans had named, between personal advancement and collective progress.

The optimists are not left empty-handed, and honesty requires saying so. In a blind study run by Chenglei Si, Diyi Yang and Tatsunori Hashimoto at Stanford in 2024, some seventy-nine expert reviewers judged research ideas generated by a large language model to be more novel than those of human experts (p < 0.05). The same machine ideas scored slightly weaker on feasibility, a caveat the authors themselves press.

And AlphaFold, the prize-winner, turns out on closer inspection to have redirected science rather than accelerated it. Ryan Hill and Carolyn Stein, in a 2026 working paper (not yet peer-reviewed, so hold it lightly), find that after AlphaFold2's 2021 release the rate of experimental structure determination barely changed. Basic research on previously unstructured proteins rose by 15 to 40 percent, with no measurable shift yet into early-stage drug development. AlphaFold simply pointed scientists toward the proteins they had long ignored.

There is a nice, ironic twist in our tale. Four years earlier, Bianchini himself, with Müller and Pelletier, had argued that AI was diffusing as a "general method of invention" that tracked well-defined research trajectories and was associated with less novelty. The 2026 paper reverses that picture, conditionally; its lead author was honest enough to revise himself in public.

The reversal is not really a contradiction. The data set grew, the transformer and the large language model arrived in the interval, and the measurement followed them. The field's verdict on AI is young enough that even its own measurers are still revising it.

The sharpest cautionary tale arrived in late 2024, when a graduate student named Aidan Toner-Rodgers circulated a working paper claiming that an AI tool, deployed at a 1,018-scientist materials lab, had lifted discovery by 44 percent, patents by 39 percent, and product innovation by 17 percent. It was the single cleanest empirical proof anyone had produced that AI supercharges discovery, and it was embraced by Daron Acemoglu and David Autor, two of the most decorated economists alive. Then it dissolved.

In May 2025 MIT announced it had "no confidence in the provenance, reliability or validity of the data", the paper was withdrawn from arXiv, and its author left the institution. "There is no world where this makes any sense," Autor told the Wall Street Journal. MIT itself noted that "even in its non-published form, the paper is having an impact on discussions and projections about the effects of AI on science." The 44 percent did not survive scrutiny. What lingers is how badly the discipline's most distinguished readers seem to have wanted it to.

If AI's value depends on terrain and on depth of adoption, the binding constraint may lie in diffusion: getting AI well into the hard, fragmented fields where it earns its novelty and where adoption still lags. That, as it happens, is the bet Europe has begun to place.

Sources

Primary source: Bianchini, Stefano, Valentina Di Girolamo, Julien Ravet & David Arranz. "AI in science: When and where it makes a difference." Research Policy 55 (2026) 105478.
Context sources:
- Hao, Xu, Li & Evans. "AI tools expand scientists' impact but contract science's reach." Nature (2026); preprint arXiv:2412.07727.
- Messeri, Lisa & M. J. Crockett. "Artificial intelligence and illusions of understanding in scientific research." Nature 627 (2024).
- Gao, Jian & Dashun Wang. "Quantifying the use and potential benefits of artificial intelligence in scientific research." Nature Human Behaviour 8 (2024).
- Si, Chenglei, Diyi Yang & Tatsunori Hashimoto. "Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study." arXiv:2409.04109 (2024).
- Hill, Ryan & Carolyn Stein. "How Artificial Intelligence Shapes Science: Evidence from AlphaFold." Working paper (2026).
- Bianchini, Stefano, Moritz Müller & Pierre Pelletier. "Artificial intelligence in science: An emerging general method of invention." Research Policy 51(10) (2022).
- Google DeepMind. "A new golden age of discovery" (policy essay, 2024) and "Demis Hassabis and John Jumper awarded Nobel Prize in Chemistry."
- "The Nobel Prize in Chemistry 2024." Royal Swedish Academy of Sciences, October 9, 2024.
- Toner-Rodgers, Aidan. "Artificial Intelligence, Scientific Discovery, and Product Innovation." arXiv:2412.17866 (2024, withdrawn); MIT Department of Economics, "Assuring an accurate research record" (May 16, 2025).
- European Commission. "European Strategy for AI in Science" (October 8, 2025) and RAISE pilot launch (Copenhagen, November 2025).

Fact Check: Claim-by-Claim Verification Verified

A two-round Claude + Perplexity dialogue verified all seventeen major claims. Every empirical figure (the Bianchini 80-million-paper study, the Evans/Xu 3.02x/4.84x/1.37-year effects, the Hill/Stein AlphaFold results, the Toner-Rodgers 44/39/17 percent figures) checks out against primary sources. One attribution error was found and fixed: a verbatim DeepMind statement had been misattributed to John Jumper's acceptance of the Nobel Prize.

The 2024 Nobel Prize in Chemistry (announced 9 October 2024) went to three laureates, two of whom (Hassabis, Jumper of DeepMind) built AlphaFold.

The Chemistry prize was announced 9 October 2024 (Physics was 8 October), awarded half to David Baker and half jointly to Demis Hassabis and John Jumper for AlphaFold protein-structure prediction. Nobel Prize press release.

AI "will make science faster and ultimately help to understand disease and develop therapeutics."

The sentence is verbatim from DeepMind's Nobel announcement ("It is a key demonstration that AI will make science faster and ultimately help to understand disease and develop therapeutics"), which is an institutional statement, not words Jumper spoke while accepting the prize. DeepMind Nobel announcement.

DeepMind's policy team announced "a new golden age of discovery."

"A New Golden Age of Discovery" is the title of Google DeepMind's policy essay on AI for science. AI Policy Perspectives.

Messeri (Yale) and Crockett (Princeton) wrote in Nature that AI risks a phase where "we produce more but understand less," warning of "scientific monocultures."

Both quotations appear verbatim in their 2024 Nature commentary. Messeri & Crockett, Nature (2024).

Bianchini et al. (2026, Research Policy) analysed more than 80 million OpenAlex papers, 2005-2023, across 172 fields.

The paper states a dataset of more than 80 million papers published 2005-2023 from the OpenAlex snapshot. Bianchini et al., Research Policy (2026).

AI papers are roughly 30-40% more likely than field-mates to introduce linguistic or conceptual novelty.

The paper reports AI adoption is associated with more novel and highly cited research, with the novelty-probability uplift in this range; exact figures vary by specification. Research Policy (2026).

The novelty benefit is strong in rugged/fragmented fields (astronomy, bioinformatics, climatology, geodesy, medical physics, radiology) and turns negative in codified fields (cell biology, molecular biology, traditional medicine).

The paper finds AI's novelty effect is moderated by knowledge-space roughness, strongest in fragmented fields and negative in fields with standardised protocols. Research Policy (2026).

The citation premium is unrelated to knowledge structure and "modest" on average; impact may reflect social/institutional mechanisms.

The paper reports the average citation effect is modest and largely decoupled from knowledge structure, attributing impact to visibility, collaboration networks and reputation. Research Policy (2026).

Evans (Chicago) and Xu (Tsinghua), Nature 2026: AI adopters publish 3.02x more, are cited 4.84x more, lead projects 1.37 years earlier; adoption shrinks topics studied by 4.63% and cuts scientist engagement by 22%.

All five figures and the "automate established fields rather than explore new ones" verdict are confirmed against the paper "AI tools expand scientists' impact but contract science's reach." Nature (2026).

Gao and Wang (Northwestern, 2024) found AI's benefits skew away from fields with more women and Black scientists.

The paper's abstract explicitly states disciplines with higher proportions of women or Black scientists reap fewer benefits from AI, risking wider inequality. Gao & Wang, Nature Human Behaviour (2024).

Stanford blind study (Si, Yang, Hashimoto, 2024): 79 expert reviewers judged LLM-generated ideas more novel than human experts' (p < 0.05), but slightly weaker on feasibility.

The study used 49 idea-writers and 79 reviewers; LLM ideas were rated more novel (p < 0.05) and marginally less feasible. Si et al., arXiv (2024).

Hill and Stein (2026 working paper): after AlphaFold2 (2021) the rate of experimental structure determination barely changed, while basic research on previously unstructured proteins rose 15-40%, with no measurable shift into early-stage drug development.

The NBER working paper reports the experimental-determination rate almost unchanged and a 15-40% rise in basic research on previously structureless proteins, with no applied drug-development shift yet. Hill & Stein working paper.

An earlier Bianchini, Muller and Pelletier paper argued AI was diffusing as a "general method of invention" associated with less novelty.

Their 2022 Research Policy paper framed AI as an emerging general method of invention tracking well-defined trajectories, associated with lower novelty. Bianchini, Muller & Pelletier, Research Policy (2022).

Toner-Rodgers's late-2024 working paper claimed an AI tool at a 1,018-scientist materials lab lifted discovery 44%, patents 39%, and product innovation 17%; endorsed by Acemoglu and Autor.

The paper's abstract stated 44% more materials discovered, a 39% rise in patent filings and a 17% rise in downstream product innovation among 1,018 scientists. Toner-Rodgers working paper.

In May 2025 MIT stated it had "no confidence in the provenance, reliability or validity of the data"; the paper was withdrawn and the author left; Autor said "There is no world where this makes any sense."

MIT's statement, the arXiv withdrawal and Autor's quote are documented. MIT Economics statement; WSJ via To Vima.

The European Commission launched a Strategy for AI in Science on 8 October 2025 and, about a month later, a RAISE pilot in Copenhagen.

The Strategy is dated 8 October 2025; the RAISE pilot was launched at the AI in Science Summit in Copenhagen on 3-4 November 2025. EC AI in Science Strategy; EC press release.

Ada Lovelace (1843) wrote the Analytical Engine "has no pretensions whatever to originate anything"; Turing (1950) countered that machines "can take us by surprise."

Lovelace's line is from her 1843 Notes on the Analytical Engine; Turing's "Machines take me by surprise with great frequency" is from Computing Machinery and Intelligence (1950), fairly paraphrased. Lovelace, Note G; Turing, Mind (1950).

Commentary

The Bianchini (2026) and Hill/Stein (2026) papers are recent; Hill/Stein remains a working paper not yet peer-reviewed, which the article correctly flags.
The 30-40% novelty figure is an approximate range; the paper reports different point estimates by specification.
The Toner-Rodgers figures are reported strictly as the withdrawn paper's original claims, which did not survive MIT's data-integrity review; the article frames them correctly as discredited.