SIGTYP 2020 Shared Task: Prediction of Typological Features

Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Celano Giuseppe, Edoardo Maria Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world’s languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.
Original languageEnglish
Title of host publicationProceedings of the Second Workshop on Computational Research in Linguistic Typology
Number of pages11
Place of PublicationOnline
PublisherAssociation for Computational Linguistics
Publication date1 Nov 2020
Pages1-11
DOIs
Publication statusPublished - 1 Nov 2020
EventThe Second Workshop on Computational Research in Linguistic Typology -
Duration: 19 Nov 202020 Nov 2020

Workshop

WorkshopThe Second Workshop on Computational Research in Linguistic Typology
Period19/11/202020/11/2020

Keywords

  • Natural Language Processing

Fingerprint

Dive into the research topics of 'SIGTYP 2020 Shared Task: Prediction of Typological Features'. Together they form a unique fingerprint.

Cite this