Abstract
Authorship attribution (AA) is concerned with the task of finding out about the true authorship of a disputed text based on a set of documents of known authorship. In this paper, we investigate the potential of Bootstrap Consensus Networks (BCN) – a novel approach to generate visualizations in stylometry by mapping similarities of authorial style between texts into the form of a network – for large-scale authorship attribution tasks. We apply this method to the freedom of the press writings (Trykkefrihedsskrifter), a corpus of pamphlets published and collected in Denmark at the end of the 18th century. By conducting multiple experiments, we find that the size of the constructed networks depends heavily on the type of variables and distance measures used. Furthermore, we find that, although a small set of unknown authorship problems can be solved, in general, the precision of the BCN method is too low to apply it in a large-scale AA scenario.
Original language | English |
---|---|
Journal | CEUR Workshop Proceedings |
Volume | 2612 |
Pages (from-to) | 110-124 |
Number of pages | 15 |
ISSN | 1613-0073 |
Publication status | Published - 1 Jan 2020 |
Event | 5th Conference on Digital Humanities in the Nordic Countries, DHN 2020 - Riga, Latvia Duration: 21 Oct 2020 → 23 Oct 2020 |
Conference
Conference | 5th Conference on Digital Humanities in the Nordic Countries, DHN 2020 |
---|---|
Country/Territory | Latvia |
City | Riga |
Period | 21/10/2020 → 23/10/2020 |
Keywords
- Authorship Attribution
- Bootstrap Consensus Network (BCN)
- Stylometry