Morphology Matters: Probing the Cross-linguistic Morphological Generalization Abilities of Large Language Models through a Wug Test

Anh Dang, Limor Raviv, Lukas Galke

Publikation: Kapitel i bog/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

We develop a multilingual version of the Wug Test, an artificial word completion experiment that is typically used to test the morphological knowledge of children, and apply it to the GPT family of large language models (LLMs). LLMs’ performance on this test was evaluated by native speakers of six different languages, who judged whether the inflected and derived forms generated by the models conform to the morphological rules of their language. Our results show that LLMs can generalize their morphological knowledge to new, unfamiliar words, but that their success in generating the “correct” generalization (as judged by native human speakers) is predicted by a language’s morphological complexity (specifically, integrative complexity). We further find that the amount of training data has surprisingly little on LLMs’ morphological generalization abilities within the scope of the analyzed languages. These findings highlight that “morphology matters”, and have important implications for improving low-resource language modeling.

OriginalsprogEngelsk
TitelCMCL 2024 - 13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, Proceedings of the Workshop
RedaktørerTatsuki Kuribayashi, Giulia Rambelli, Ece Takmaz, Philipp Wicke, Yohei Oseki
ForlagAssociation for Computational Linguistics (ACL)
Publikationsdato2024
Sider177-188
ISBN (Elektronisk)9798891761438
StatusUdgivet - 2024
Udgivet eksterntJa
Begivenhed13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2024 - Bangkok, Thailand
Varighed: 15. aug. 2024 → …

Konference

Konference13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2024
Land/OmrådeThailand
ByBangkok
Periode15/08/2024 → …
SponsorJapan Science and Technology Agency
NavnCMCL 2024 - 13th Edition of the Workshop on Cognitive Modeling and Computational Linguistics, Proceedings of the Workshop

Bibliografisk note

Publisher Copyright:
©2024 Association for Computational Linguistics.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Morphology Matters: Probing the Cross-linguistic Morphological Generalization Abilities of Large Language Models through a Wug Test'. Sammen danner de et unikt fingeraftryk.

Citationsformater