MultiplEYE: Creating a multilingual eye-tracking-while-reading corpus

Deborah Noemie Jakobi*, Maja Stegenwallner-Schütz, Nora Hollenstein, Cui Ding, Ramune Kaspere, Ana Matić Škorić, Eva Pavlinusic Vilus, Stefan Frank, Marie Luise Müller, Kristine M. Jensen de López, Nik Kharlamov, Hanne Bruun Søndergaard Knudsen, Yevgeni Berzak, Ella Lion, Irina Sekerina, Cengiz Acartürk, Mohd Faizan Ansari, Katarzyna Harężlak, Paweł Kasprowski, Ana BautistaLisa Beinborn, Anna Bondar, Antonia Boznou, Leah Bradshaw, Jana Mara Hofmann, Thyra Krosness, Not Battesta Soliva, Anila Çepani, Kristina Cergol, Ana Došen, Marijan Palmovic, Adelina Çerpja, Dalí Chirino, Jan Chromý, Vera Demberg, Iza Škrjanec, Nazik Dinçtopal Deniz, Inmaculada Fajardo, Mariola Giménez-Salvador, Xavier Mínguez-López, Maroš Filip, Zigmunds Freibergs, Jéssica Gomes, Andreia Janeiro, Paula Luegi, João Veríssimo, Sasho Gramatikov, Jana Hasenäcker, Alba Haveriku, Nelda Kote, Muhammad M. Kamal, Hanna Kędzierska, Dorota Klimek-Jankowska, Sara Kosutar, Daniel Krakowczyk, Izabela Krejtz, Marta Łockiewicz, Kaidi Lõo, Jurgita Motiejūnienė, Jamal A. Nasir, Johanne Sofie Nedergård, Ayşegül Özkan, Mikuláš Preininger, Loredana Pungă, David Reich, Chiara Tschirner, Špela Rot, Andreas Säuberli, Jordi Solé-Casals, Ekaterina Strati, Igor Svoboda, Evis Trandafili, Spyridoula Varlokosta, Mila Vulchanova, Lena Ann Jäger

*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

1 Downloads (Pure)

Abstract

Eye-tracking-while-reading data provide valuable insights across multiple disciplines, including psychology, linguistics, natural language processing, education, and human-computer interaction. Despite its potential, the availability of large, high-quality, multilingual datasets remains limited, hindering both foundational reading research and advancements in applications. The MultiplEYE project addresses this gap by establishing a large-scale, international eye-tracking data collection initiative. It aims to create a multilingual dataset of eye movements recorded during natural reading, balancing linguistic diversity, while ensuring methodological consistency for reliable cross-linguistic comparisons. The dataset spans numerous languages and follows strict procedural, documentation, and data pre-processing standards to enhance eye-tracking data transparency and reproducibility. A novel data-sharing framework, integrated with data quality reports, allows for selective data filtering based on research needs. Researchers and labs worldwide are invited to join the initiative. By establishing and promoting standardized practices and open data sharing, MultiplEYE facilitates interdisciplinary research and advances reading research and gaze-augmented applications.
Original languageEnglish
Title of host publicationETRA '25: Proceedings of the 2025 ACM Symposium on Eye Tracking Research and Applications : May 26-29, 2025, Tokyo, Japan
EditorsStephen N. Spencer
Number of pages11
PublisherAssociation for Computing Machinery (ACM)
Publication date25 May 2025
Article number111
ISBN (Electronic)9798400714870
DOIs
Publication statusPublished - 25 May 2025
Event17th ACM Symposium on Eye Tracking Research & Applications (ETRA 2025) - Tokyo, Japan
Duration: 26 May 202529 May 2025
https://etra.acm.org/2025/

Conference

Conference17th ACM Symposium on Eye Tracking Research & Applications (ETRA 2025)
Country/TerritoryJapan
CityTokyo
Period26/05/202529/05/2025
Internet address
SeriesEye Tracking Research and Applications Symposium (ETRA)

Keywords

  • Eye Movement
  • Eye Tracking
  • Natural Language Processing (NLP)
  • Open Data
  • Psycholinguistics
  • Reading
  • open science
  • psycholinguistics
  • Eye-tracking
  • reading
  • multilingual

Fingerprint

Dive into the research topics of 'MultiplEYE: Creating a multilingual eye-tracking-while-reading corpus'. Together they form a unique fingerprint.

Cite this