Unequal Scientific Recognition in the Age of LLMs

Publication

Findings of the Association for Computational Linguistics: EMNLP 2025

pages 23558–23568

November 9, 2025

Resources

Large language models (LLMs) are reshaping how scientific knowledge is accessed and represented. This study evaluates the extent to which popular and frontier LLMs including GPT-4o, Claude 3.5 Sonnet, and Gemini 1.5 Pro recognize scientists, benchmarking their outputs against OpenAlex and Wikipedia. Using a dataset focusing on 100,000 physicists from OpenAlex to evaluate LLM recognition, we uncover substantial disparities: LLMs exhibit selective and inconsistent recognition patterns. Recognition correlates strongly with scholarly impact such as citations, and remains uneven across gender and geography. Women researchers, and researchers from Africa, Asia, and Latin America are significantly underrecognized. We further examine the role of training data provenance, identifying Wikipedia as a potential sources that contributes to recognition gaps. Our findings highlight how LLMs can reflect, and potentially amplify existing disparities in science, underscoring the need for more transparent and inclusive knowledge systems.

‍

NetSI authors

Yixuan Liu

Network Science PhD Student

Rodrigo Dorantes-Gilardi

Associate Research Scientist

Albert-László Barabási

University Distinguished Professor

Share this page:

Unequal Scientific Recognition in the Age of LLMs

Publication

Research area

Resources

NetSI authors

Related publications