SIGTYP 2020 Shared Task: Prediction of Typological Features

Johannes Bjerva; Elizabeth Salesky; Sabrina J. Mielke; Aditi Chaudhary; Celano Giuseppe; Edoardo Maria Ponti; Ekaterina Vylomova; Ryan Cotterell; Isabelle Augenstein

doi:10.18653/v1/2020.sigtyp-1.1

SIGTYP 2020 Shared Task: Prediction of Typological Features

Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Celano Giuseppe, Edoardo Maria Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Abstract

Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world’s languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.

Original language	English
Title of host publication	Proceedings of the Second Workshop on Computational Research in Linguistic Typology
Number of pages	11
Place of Publication	Online
Publisher	Association for Computational Linguistics
Publication date	1 Nov 2020
Pages	1-11
DOIs	https://doi.org/10.18653/v1/2020.sigtyp-1.1
Publication status	Published - 1 Nov 2020
Event	The Second Workshop on Computational Research in Linguistic Typology - Duration: 19 Nov 2020 → 20 Nov 2020

Workshop

Workshop	The Second Workshop on Computational Research in Linguistic Typology
Period	19/11/2020 → 20/11/2020

Keywords

Natural Language Processing

Access to Document

10.18653/v1/2020.sigtyp-1.1

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

Bjerva, J., Salesky, E., Mielke, S. J., Chaudhary, A., Giuseppe, C., Ponti, E. M., Vylomova, E., Cotterell, R., & Augenstein, I. (2020). SIGTYP 2020 Shared Task: Prediction of Typological Features. In Proceedings of the Second Workshop on Computational Research in Linguistic Typology (pp. 1-11). Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.sigtyp-1.1

@inproceedings{77651e7ae69e44b4bff2fd0ab055b5e1,

title = "SIGTYP 2020 Shared Task: Prediction of Typological Features",

abstract = "Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world{\textquoteright}s languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.",

keywords = "Natural Language Processing",

author = "Johannes Bjerva and Elizabeth Salesky and Mielke, {Sabrina J.} and Aditi Chaudhary and Celano Giuseppe and Ponti, {Edoardo Maria} and Ekaterina Vylomova and Ryan Cotterell and Isabelle Augenstein",

year = "2020",

month = nov,

day = "1",

doi = "10.18653/v1/2020.sigtyp-1.1",

language = "English",

pages = "1--11",

booktitle = "Proceedings of the Second Workshop on Computational Research in Linguistic Typology",

publisher = "Association for Computational Linguistics",

address = "United States",

note = "The Second Workshop on Computational Research in Linguistic Typology, SIGTYP 2020 ; Conference date: 19-11-2020 Through 20-11-2020",

}

Bjerva, J, Salesky, E, Mielke, SJ, Chaudhary, A, Giuseppe, C, Ponti, EM, Vylomova, E, Cotterell, R & Augenstein, I 2020, SIGTYP 2020 Shared Task: Prediction of Typological Features. in Proceedings of the Second Workshop on Computational Research in Linguistic Typology. Association for Computational Linguistics, Online, pp. 1-11, The Second Workshop on Computational Research in Linguistic Typology, 19/11/2020. https://doi.org/10.18653/v1/2020.sigtyp-1.1

SIGTYP 2020 Shared Task: Prediction of Typological Features. / Bjerva, Johannes; Salesky, Elizabeth; Mielke, Sabrina J. et al.
Proceedings of the Second Workshop on Computational Research in Linguistic Typology. Online: Association for Computational Linguistics, 2020. p. 1-11.

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - SIGTYP 2020 Shared Task

T2 - The Second Workshop on Computational Research in Linguistic Typology

AU - Bjerva, Johannes

AU - Salesky, Elizabeth

AU - Mielke, Sabrina J.

AU - Chaudhary, Aditi

AU - Giuseppe, Celano

AU - Ponti, Edoardo Maria

AU - Vylomova, Ekaterina

AU - Cotterell, Ryan

AU - Augenstein, Isabelle

PY - 2020/11/1

Y1 - 2020/11/1

N2 - Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world’s languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.

AB - Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world’s languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.

KW - Natural Language Processing

U2 - 10.18653/v1/2020.sigtyp-1.1

DO - 10.18653/v1/2020.sigtyp-1.1

M3 - Article in proceeding

SP - 1

EP - 11

BT - Proceedings of the Second Workshop on Computational Research in Linguistic Typology

PB - Association for Computational Linguistics

CY - Online

Y2 - 19 November 2020 through 20 November 2020

ER -

SIGTYP 2020 Shared Task: Prediction of Typological Features

Abstract

Workshop

Keywords

Access to Document

AUB Link

Fingerprint

Cite this