Data citation and the citation graph

Peter Buneman; Dennis Dosso; Matteo Lissandrini; Gianmaria Silvello

doi:10.1162/qss_a_00166

Data citation and the citation graph

Peter Buneman, Dennis Dosso^*, Matteo Lissandrini, Gianmaria Silvello

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

11 Citations (Scopus)

160 Downloads (Pure)

Abstract

The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications.

In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.

Original language	English
Journal	Quantitative Science Studies
Volume	2
Issue number	4
Pages (from-to)	1399-1422
Number of pages	24
DOIs	https://doi.org/10.1162/qss_a_00166
Publication status	Published - 4 Feb 2022

Bibliographical note

Funding Information:
The work was partially supported by the ExaMode project, as part of the European Union H2020 program under Grant Agreement No. 825292. Matteo Lissandrini is supported by t he European Union H2020 research and i nnovation program under the Mari e Sk?odowska-Curie grant agreement No. 838216.

Publisher Copyright:
© 2021 Peter Buneman, Dennis Dosso, Matteo Lissandrini, and Gianmaria Silvello.

Keywords

Bibliometrics
Citation graph
Data citation

Access to Document

10.1162/qss_a_00166

Open Access articleFinal published version, 1.45 MBLicence: CC BY 4.0

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{9c2ff9f880994e9d9683b7ccd1b64e85,

title = "Data citation and the citation graph",

abstract = "The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications.In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.",

keywords = "Bibliometrics, Citation graph, Data citation",

author = "Peter Buneman and Dennis Dosso and Matteo Lissandrini and Gianmaria Silvello",

note = "Funding Information: The work was partially supported by the ExaMode project, as part of the European Union H2020 program under Grant Agreement No. 825292. Matteo Lissandrini is supported by t he European Union H2020 research and i nnovation program under the Mari e Sk?odowska-Curie grant agreement No. 838216. Publisher Copyright: {\textcopyright} 2021 Peter Buneman, Dennis Dosso, Matteo Lissandrini, and Gianmaria Silvello.",

year = "2022",

month = feb,

day = "4",

doi = "10.1162/qss_a_00166",

language = "English",

volume = "2",

pages = "1399--1422",

journal = "Quantitative Science Studies",

issn = "2641-3337",

publisher = "MIT Press",

number = "4",

}

TY - JOUR

T1 - Data citation and the citation graph

AU - Buneman, Peter

AU - Dosso, Dennis

AU - Lissandrini, Matteo

AU - Silvello, Gianmaria

N1 - Funding Information: The work was partially supported by the ExaMode project, as part of the European Union H2020 program under Grant Agreement No. 825292. Matteo Lissandrini is supported by t he European Union H2020 research and i nnovation program under the Mari e Sk?odowska-Curie grant agreement No. 838216. Publisher Copyright: © 2021 Peter Buneman, Dennis Dosso, Matteo Lissandrini, and Gianmaria Silvello.

PY - 2022/2/4

Y1 - 2022/2/4

N2 - The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications.In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.

AB - The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications.In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.

KW - Bibliometrics

KW - Citation graph

KW - Data citation

UR - http://www.scopus.com/inward/record.url?scp=85124395310&partnerID=8YFLogxK

U2 - 10.1162/qss_a_00166

DO - 10.1162/qss_a_00166

M3 - Journal article

AN - SCOPUS:85124395310

SN - 2641-3337

VL - 2

SP - 1399

EP - 1422

JO - Quantitative Science Studies

JF - Quantitative Science Studies

IS - 4

ER -

Data citation and the citation graph

Abstract

Bibliographical note

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Expanding the Citation Graph for Data Citations

Cite this