Projektdetaljer
Beskrivelse
Language is the key to accessing the modern technology on which our society relies, such as online search, spelling correction, and automatic translation. However, out of the over 7,000 languages in the world, only a handful have access to such technology. This is in part due to state-of-the-art solutions requiring vast amounts of data, which is unavailable to most languages, which can be referred to as resource-poor. Hence, most languages are marginalized in the current technological development, and will continue to be so unless fundamental changes are made. My project is about addressing this issue, by making use of the fact that languages often have systematic similarities with one another, aiming to increase technological access to billions of speakers of resource-poor languages.
Due to the project's interdisciplinary angle, and its focus on human language, it is uniquely positioned to have a substantial social impact. An example of its importance can be found in the UN's Sustainable Development Goals, which outlines a concrete aim in this direction (9.c.), looking to increase ICT access in least developed countries. However, simply providing ICT access will not have the impact imagined for speakers of resource-poor languages, as physical access to ICT does not equate to access to modern technologies. Real access requires fundamental changes in how we approach these languages. When successful, the findings of the project has the potential to impact the lives of billions of speakers of low-resource languages in third-world regions, but also domestically in terms of, e.g., Faroese. In short, providing people with access to language technologies in their native languages, will lead to increased life quality and equality across both languages and cultures in the world.
| Status | Igangværende |
|---|---|
| Effektiv start/slut dato | 01/09/2022 → 31/08/2026 |
Finansiering
- Google: 425.000,00 kr.
- Carlsbergfondet: 5.000.000,00 kr.
FN's verdensmål
I 2015 blev FN-landene enige om 17 verdensmål til at bekæmpe fattigdom, beskytte planeten og sikre velstand for alle. Dette projekt bidrager til følgende verdensmål:
-
Verdensmål 9 Industri, innovation og infrastruktur
-
Verdensmål 10 Mindre ulighed
Fingerprint
Aktiviteter
-
Linguistic Disparities in Language Technology: Lessons from West-Greenlandic
Ploeger, E. (Foredragsholder)
14 nov. 2025Aktivitet: Foredrag og mundtlige bidrag › Konferenceoplæg
-
Big Scandinavian Data and LLMs
Bjerva, J. (Arrangør), Frank, S. (Arrangør), Sanchez Villegas, D. (Arrangør), Schiavone, A. (Arrangør), Lavrinovics, E. (Arrangør) & Claus, H. (Arrangør)
27 aug. 2025Aktivitet: Deltagelse i faglig begivenhed › Organisering af eller deltagelse i workshop, kursus, seminar, udstilling eller lignende
-
Neural Networks
Fekete, M. R. (Foredragsholder)
4 aug. 2025 → 11 aug. 2025Aktivitet: Foredrag og mundtlige bidrag › Gæsteforelæsning
Priser
-
EliteForsk-rejsestipendium 2024
Chen, Y. (Modtager), 26 feb. 2024
Pris: Forsknings- uddannelses og innovationspriser
Publikation
-
Semantic Leakage from Image Embeddings
Chen, Y., Xu, Q., Elliott, D., Li, Q. & Bjerva, J., 2 feb. 2026, arXiv, 20 s.Publikation: Working paper/Preprint › Preprint
Åben adgang -
Towards Understanding Professional AI Assistant Use: Human-in-the-Loop Topic Modeling of Multi-turn Conversations
Chen, Y., Archasantisuk, S. & Bjerva, J., 1 feb. 2026, (Accepteret/In press) Companion Proceedings of the ACM Web Conference 2026: WWW Companion '26. Dubai, United Arab Emirates, 12 s.Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review
Åben adgang -
A Cross-Lingual Perspective on Neural Machine Translation Difficulty
Ploeger, E., Bjerva, J., Tiedemann, J. & Östling, R., nov. 2025, Proceedings of the Tenth Conference on Machine Translation. Association for Computational Linguistics (ACL), s. 340-354Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review
Åben adgangFil6 Downloads (Pure)