Abstract
A study of temporal aspects of authorship attribution - a task which aims to distinguish automatically between texts written by different authors by measuring textual features. This task is important in a number of areas, including plagiarism detection in secondary education, which we study in this work. As the academic abilities of students evolve during their studies, so does their writing style. These changes in writing style form a type of temporal context, which we study for the authorship attribution process by focussing on the students’ more recent writing samples. Experiments with real world data from Danish secondary school students show 84% prediction accuracy when using all available material and 71.9% prediction accuracy when using only the five most recent writing samples from each student.
Originalsprog | Engelsk |
---|---|
Titel | Multidisciplinary Information Retrieval : 7th Information Retrieval Facility Conference, IRFC 2014, Copenhagen, Denmark, November 10-12, 2014, Proceedings |
Redaktører | David Lamas, Paul Buitelaar |
Antal sider | 19 |
Forlag | Springer |
Publikationsdato | 1 jan. 2014 |
Sider | 22-40 |
ISBN (Trykt) | 978-3-319-12978-5 |
ISBN (Elektronisk) | 978-3-319-12979-2 |
DOI | |
Status | Udgivet - 1 jan. 2014 |
Begivenhed | The 3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference - Aalborg University Copenhagen, Copenhagen, Danmark Varighed: 11 nov. 2014 → 12 nov. 2014 Konferencens nummer: 7 |
Konference
Konference | The 3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference |
---|---|
Nummer | 7 |
Lokation | Aalborg University Copenhagen |
Land/Område | Danmark |
By | Copenhagen |
Periode | 11/11/2014 → 12/11/2014 |
Navn | Lecture Notes in Computer Science |
---|---|
Vol/bind | 8849 |
ISSN | 0302-9743 |