Linguistically Grounded Analysis of Language Models using Shapley Head Values

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Abstract

Understanding how linguistic knowledge is encoded in language models is crucial for improving their generalisation capabilities. In this paper, we investigate the processing of morphosyntactic phenomena, by leveraging a recently proposed method for probing language models via Shapley Head Values (SHVs). Using the English language BLiMP dataset, we test our approach on two widely used models, BERT and RoBERTa, and compare how linguistic constructions such as anaphor agreement and filler-gap dependencies are handled. Through quantitative pruning and qualitative clustering analysis, we demonstrate that attention heads responsible for processing related linguistic phenomena cluster together. Our results show that SHV-based attributions reveal distinct patterns across both models, providing insights into how language models organize and process linguistic information. These findings support the hypothesis that language models learn subnetworks corresponding to linguistic theory, with potential implications for cross-linguistic model analysis and interpretability in Natural Language Processing (NLP).
OriginalsprogEngelsk
TitelFindings of the Association for Computational Linguistics: NAACL 2025
ForlagAssociation for Computational Linguistics
Publikationsdato29 apr. 2025
Sider850-865
DOI
StatusUdgivet - 29 apr. 2025
BegivenhedThe 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics - Albuquerque, USA
Varighed: 29 apr. 20254 maj 2025
https://2025.naacl.org/

Konference

KonferenceThe 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics
Land/OmrådeUSA
By Albuquerque
Periode29/04/202504/05/2025
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Linguistically Grounded Analysis of Language Models using Shapley Head Values'. Sammen danner de et unikt fingeraftryk.

Citationsformater