Measuring LLM Self-consistency: Unknown Unknowns in Knowing Machines

Mathieu Jacomy*, Erik Borra

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

1 Citation (Scopus)
6 Downloads (Pure)

Abstract

This essay critically examines some limitations and misconceptions of Large Language Models (LLMs) in relation to knowledge and self-knowledge, particularly in the context of social sciences and humanities (SSH) research. Using an experimental approach, we evaluate the self-consistency of LLM responses by introducing variations in prompts during knowledge retrieval tasks. Our results indicate that self-consistency tends to align with correct responses, yet errors persist, questioning the reliability of LLMs as “knowing” agents. Drawing on epistemological frameworks, we argue that LLMs exhibit the capacity to know only when random factors, or epistemic luck, can be excluded, yet they lack self-awareness of their inconsistencies. Whereas human ignorance often involves many “known unknowns”, LLMs exhibit a form of ignorance manifested through inconsistency, where the ignorance remains a complete “unknown unknown”. LLMs always “assume” they “know”. We repurpose these insights into a pedagogical experiment, encouraging SSH scholars and students to critically engage with LLMs in educational settings. We propose a hands-on approach based on critical technical practice, aiming to balance the practical utility with an informed understanding of their limitations. This approach equips researchers with the skills to use LLMs effectively while promoting a deeper understanding of their operational principles and epistemic constraints.

Original languageEnglish
JournalSociologica
Volume18
Issue number2
Pages (from-to)25-65
Number of pages41
ISSN1971-8853
DOIs
Publication statusPublished - 2024

Bibliographical note

Publisher Copyright:
Copyright © 2024 Mathieu Jacomy, Erik Borra.

Keywords

  • critical technical practice
  • knowledge analysis
  • Large language models
  • prompt engineering
  • robustness analysis

Fingerprint

Dive into the research topics of 'Measuring LLM Self-consistency: Unknown Unknowns in Knowing Machines'. Together they form a unique fingerprint.

Cite this