Skip to main navigation Skip to search Skip to main content

Machine Bias. How Do Generative Language Models Answer Opinion Polls?

  • CERAPS
  • ENSAE

Research output: Contribution to journalArticlepeer-review

Abstract

Generative artificial intelligence (AI) is increasingly presented as a potential substitute for humans, including as research subjects. However, there is no scientific consensus on how closely these in silico clones can emulate survey respondents. While some defend the use of these “synthetic users,” others point toward social biases in the responses provided by large language models (LLMs). In this article, we demonstrate that these critics are right to be wary of using generative AI to emulate respondents, but probably not for the right reasons. Our results show (i) that to date, models cannot replace research subjects for opinion or attitudinal research; (ii) that they display a strong bias and a low variance on each topic; and (iii) that this bias randomly varies from one topic to the next. We label this pattern “machine bias,” a concept we define, and whose consequences for LLM-based research we further explore.

Original languageEnglish
Pages (from-to)1156-1196
Number of pages41
JournalSociological Methods and Research
Volume54
Issue number3 Special Issue: Integrating Generative AI into Social Scienc...
DOIs
Publication statusPublished - 1 Aug 2025

Keywords

  • LLMs
  • bias
  • computational social sciences
  • generative artificial intelligence
  • machine learning
  • survey research

Fingerprint

Dive into the research topics of 'Machine Bias. How Do Generative Language Models Answer Opinion Polls?'. Together they form a unique fingerprint.

Cite this