Linguistically Grounded Analysis of Language Models using Shapley Head Values

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

Understanding how linguistic knowledge is encoded in language models is crucial for improving their generalisation capabilities. In this paper, we investigate the processing of morphosyntactic phenomena, by leveraging a recently proposed method for probing language models via Shapley Head Values (SHVs). Using the English language BLiMP dataset, we test our approach on two widely used models, BERT and RoBERTa, and compare how linguistic constructions such as anaphor agreement and filler-gap dependencies are handled. Through quantitative pruning and qualitative clustering analysis, we demonstrate that attention heads responsible for processing related linguistic phenomena cluster together. Our results show that SHV-based attributions reveal distinct patterns across both models, providing insights into how language models organize and process linguistic information. These findings support the hypothesis that language models learn subnetworks corresponding to linguistic theory, with potential implications for cross-linguistic model analysis and interpretability in Natural Language Processing (NLP).
Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: NAACL 2025
PublisherAssociation for Computational Linguistics
Publication date29 Apr 2025
DOIs
Publication statusAccepted/In press - 23 Jan 2025
EventThe 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics - Albuquerque, United States
Duration: 29 Apr 20254 May 2025
https://2025.naacl.org/

Conference

ConferenceThe 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics
Country/TerritoryUnited States
City Albuquerque
Period29/04/202504/05/2025
Internet address

Fingerprint

Dive into the research topics of 'Linguistically Grounded Analysis of Language Models using Shapley Head Values'. Together they form a unique fingerprint.

Cite this