Personlig profil

Forskningsprofil

I am a Postdoctoral Researcher in Natural Language Processing (NLP) with a focus on NLP Applications, with a special focus on NLP for Education. I am advised by Prof. Johannes Bjerva and Prof. Euan Lindsay.

I hold a PhD in NLP from the IT University of Copenhagen (ITU), where I was advised by Prof. Barbara Plank and A/P. Rob van der Goot. I was part of NLPnorth at ITU and MaiNLP at the Ludwig Maximilian University of Munich (LMU). I worked on Computational Job Market Analysis (or NLP for HR), where we investigated how to extract information (e.g., skills) from job ads data and match these to existing resources (e.g., taxonomies).

I am interested in:

  • NLP x Education (Postdoc): Can we improve students’ learning by giving them automatic feedback from NLP tools (e.g., language models)? How can we do this over time?
  • NLP x HR (PhD): How can we extract relevant skills from job ads and in what way can we match them with existing taxonomies to assist job centers matching candidates to jobs better?
  • Resource Creation: My general interests are mostly on resource creation; such as developing annotation guidelines for data annotation, (multilingual) datasets creation in both general and specific domains, and language model training on small and large scale.

Ekspertise relateret til FN’s Verdensmål

I 2015 blev FN's medlemslande enige om 17 Verdensmål til at bekæmpe fattigdom, beskytte planeten og sikre velstand for alle. Denne persons arbejde bidrager til følgende verdensmål:

  • Verdensmål 4 - Kvalitetsuddannelse
  • Verdensmål 8 - Anstændige jobs og økonomisk vækst

Emneord

  • Datalogi

Fingerprint

Fingerprint er automatisk genererede koncepter, som stammer fra personprofilernes indhold. Det opdateres løbende med nye registreringer.
  • 1 Lignende profiler

Samarbejde i de sidste fem år

Klik på punkterne for at se detaljerne.
  • All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

    Vayani, A., Dissanayake, D., Watawana, H., Ahsan, N., Sasikumar, N., Thawakar, O., Ademtew, H. B., Hmaiti, Y., Kumar, A., Kuckreja, K., Maslych, M., Ghallabi, W. A., Qin, C., Shaker, A. M., Zhang, M., Ihsani, M. K., Esplana, A., Gokani, M., Mirkin, S. & Singh, H. & 47 flere, Srivastava, A., Hamerlik, E., Izzati, F. A., Maani, F. A., Cavada, S., Chim, J., Gupta, R., Manjunath, S., Zhumakhanova, K., Rabevohitra, F. H., Amirudin, A., Ridzuan, M., Kareem, D., More, K., Li, K., Shakya, P., Saad, M., Ghasemaghaei, A., Djanibekov, A., Azizov, D., Jankovic, B., Bhatia, N., Obando-Ceron, J., Otieno, O., Farestam, F., Rabbani, M., Baliah, S., Sanjeev, S., Shtanchaev, A., Fatima, M., Nguyen, T., Kareem, A., Aremu, T., Xavier, N., Bhatkal, A., Toyin, H., Chadha, A., Cholakkal, H., Anwer, R. M., Felsberg, M., Laaksonen, J., Solorio, T., Choudhury, M., Laptev, I., Shah, M., Khan, S. & Khan, F., 2025, arXiv, 26 s.

    Publikation: Working paper/PreprintPreprint

    Åben adgang
  • All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

    Vayani, A., Dissanayake, D., Watawana, H., Ahsan, N., Sasikumar, N., Thawakar, O., Ademtew, H. B., Hmaiti, Y., Kumar, A., Kuckreja, K., Maslych, M., Ghallabi, W. A., Qin, C., Shaker, A. M., Zhang, M., Ihsani, M. K., Esplana, A., Gokani, M., Mirkin, S. & Singh, H. & 47 flere, Srivastava, A., Hamerlik, E., Izzati, F. A., Maani, F. A., Cavada, S., Chim, J., Gupta, R., Manjunath, S., Zhumakhanova, K., Rabevohitra, F. H., Amirudin, A., Ridzuan, M., Kareem, D., More, K., Li, K., Shakya, P., Saad, M., Ghasemaghaei, A., Djanibekov, A., Azizov, D., Jankovic, B., Bhatia, N., Obando-Ceron, J., Otieno, O., Farestam, F., Rabbani, M., Baliah, S., Sanjeev, S., Shtanchaev, A., Fatima, M., Nguyen, T., Kareem, A., Aremu, T., Xavier, N., Bhatkal, A., Toyin, H., Chadha, A., Cholakkal, H., Anwer, R. M., Felsberg, M., Laaksonen, J., Solorio, T., Choudhury, M., Laptev, I., Shah, M., Khan, S. & Khan, F., 10 jun. 2025, 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (Institute of Electrical and Electronics Engineers), s. 19565-19575 11 s. 11094031. (I E E E Conference on Computer Vision and Pattern Recognition. Proceedings).

    Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

    1 Citationer (Scopus)
    157 Downloads (Pure)
  • Cross-Lingual Sentence-Level Skill Identification in English and Danish Job Advertisements

    Musazade, N., Zhang, M. & Mezei, J., 3 aug. 2025, (Accepteret/In press) International Conference on Natural Language and Speech Processing 2025. Odense, Denmark: Association for Computational Linguistics

    Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

    Åben adgang
    Fil
    35 Downloads (Pure)
  • DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

    Müller-Eberstein, M., Zhang, M., Bassignana, E., Brunsgaard Trolle, P. & van der Goot, R., 29 apr. 2025, 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP). Association for Computational Linguistics

    Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

    15 Downloads (Pure)
  • HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

    Aavang, R. T., Rizzi, G., Bøggild, R., Iolov, A., Zhang, M. & Bjerva, J., feb. 2025, (Afsendt).

    Publikation: Working paper/PreprintPreprint