Projekter pr. år
Abstract
Datasets play a central role in scholarly communications. However, scholarly graphs are often incomplete, particularly due to the lack of connections between publications and datasets. Therefore, the importance of dataset recommendation—identifying relevant datasets for a scientific paper, an author, or a textual query—is increasing. Although various methods have been proposed for this task, their reproducibility remains unexplored, making it difficult to compare them with new approaches. We reviewed current recommendation methods for scientific datasets, focusing on the most recent and competitive approaches, including an SVM-based model, a bi-encoder retriever, a method leveraging co-authors and citation network embeddings, and a heterogeneous variational graph autoencoder. These approaches underwent a comprehensive analysis under consistent experimental conditions. Our reproducibility efforts show that three methods can be reproduced, while the graph variational autoencoder is challenging due to unavailable code and test datasets. Hence, we re-implemented this method and performed a component-based analysis to examine its strengths and limitations. Furthermore, our study indicated that three out of four considered methods produce subpar results when applied to real-world data instead of specialized datasets with ad-hoc features.
Originalsprog | Engelsk |
---|---|
Titel | RecSys '24: Proceedings of the 18th ACM Conference on Recommender Systems |
Antal sider | 10 |
Udgivelsessted | New York, NY, USA |
Forlag | Association for Computing Machinery (ACM) |
Publikationsdato | okt. 2024 |
Sider | 570-579 |
ISBN (Elektronisk) | 9798400705052 |
DOI | |
Status | Udgivet - okt. 2024 |
Begivenhed | RecSys '24: 18th ACM Conference on Recommender Systems - Bari, Italien Varighed: 14 okt. 2024 → 18 okt. 2024 https://dl.acm.org/doi/proceedings/10.1145/3640457 |
Konference
Konference | RecSys '24: 18th ACM Conference on Recommender Systems |
---|---|
Land/Område | Italien |
By | Bari |
Periode | 14/10/2024 → 18/10/2024 |
Internetadresse |
Fingeraftryk
Dyk ned i forskningsemnerne om 'Reproducibility and Analysis of Scientific Dataset Recommendation Methods'. Sammen danner de et unikt fingeraftryk.Projekter
- 1 Igangværende
-
HEREDITARY: Heterogeneous semantic data integration for the gut-brain interplay
Dell'Aglio, D. (PI (principal investigator)), Lissandrini, M. (Projektdeltager), Rodriguez, J. M. (Projektdeltager), Montoya, G. (Projektdeltager), Fabbian, L. (Projektdeltager) & Trudslev, F. M. (Projektdeltager)
01/01/2024 → 31/12/2027
Projekter: Projekt › Forskning