Assigning Diagnosis Codes Using Medication History

Emil Riis Hansen; Tomer Sagi; Katja Hose; Gregory Y. H. Lip; Torben Bjerregaard Larsen; Flemming Skjøth

doi:10.1016/j.artmed.2022.102307

Assigning Diagnosis Codes Using Medication History

Emil Riis Hansen^*, Tomer Sagi, Katja Hose^*, Gregory Y. H. Lip, Torben Bjerregaard Larsen, Flemming Skjøth

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

5 Citations (Scopus)

183 Downloads (Pure)

Abstract

Diagnosis assignment is the process of assigning disease codes to patients. Automatic diagnosis assignment has the potential to validate code assignments, correct erroneous codes, and register completion. Previous methods build on text-based techniques utilizing medical notes but are inapplicable in the absence of these notes. We propose using patients' medication data to assign diagnosis codes. We present a proof-of-concept study using medical data from an American dataset (MIMIC-III) and Danish nationwide registers to train a machine-learning-based model that predicts an extensive collection of diagnosis codes for multiple levels of aggregation over a disease hierarchy. We further suggest a specialized loss function designed to utilize the innate hierarchical nature of the disease hierarchy. We evaluate the proposed method on a subset of 567 disease codes. Moreover, we investigate the technique's generalizability and transferability by (1) training and testing models on the same subsets of disease codes over the two medical datasets and (2) training models on the American dataset while evaluating them on the Danish dataset, respectively. Results demonstrate the proposed method can correctly assign diagnosis codes on multiple levels of aggregation from the disease hierarchy over the American dataset with recall 70.0% and precision 69.48% for top-10 assigned codes; thereby being comparable to text-based techniques. Furthermore, the specialized loss function performs consistently better than the non-hierarchical state-of-the-art version. Moreover, results suggest the proposed method is language and dataset-agnostic, with initial indications of transferability over subsets of disease codes.

Translated title of the contribution	Tildeling af diagnosekoder ved hjælp af medicin historik
Original language	English
Article number	102307
Journal	Artificial Intelligence in Medicine
Volume	128
Issue number	1
Number of pages	11
ISSN	0933-3657
DOIs	https://doi.org/10.1016/j.artmed.2022.102307
Publication status	Published - Jun 2022

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.artmed.2022.102307Licence: CC BY 4.0

Assigning_Diagnosis_Codes_Using_medication_HistoryAccepted author manuscript, 1.46 MBLicence: CC BY 4.0
Assigning_Diagnosis_codes_Using_Medication_History_Published_VersionFinal published version, 1.08 MBLicence: CC BY 4.0

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{12539cc96cce40f8aaec495549ce7abd,

title = "Assigning Diagnosis Codes Using Medication History",

abstract = "Diagnosis assignment is the process of assigning disease codes to patients. Automatic diagnosis assignment has the potential to validate code assignments, correct erroneous codes, and register completion. Previous methods build on text-based techniques utilizing medical notes but are inapplicable in the absence of these notes. We propose using patients' medication data to assign diagnosis codes. We present a proof-of-concept study using medical data from an American dataset (MIMIC-III) and Danish nationwide registers to train a machine-learning-based model that predicts an extensive collection of diagnosis codes for multiple levels of aggregation over a disease hierarchy. We further suggest a specialized loss function designed to utilize the innate hierarchical nature of the disease hierarchy. We evaluate the proposed method on a subset of 567 disease codes. Moreover, we investigate the technique's generalizability and transferability by (1) training and testing models on the same subsets of disease codes over the two medical datasets and (2) training models on the American dataset while evaluating them on the Danish dataset, respectively. Results demonstrate the proposed method can correctly assign diagnosis codes on multiple levels of aggregation from the disease hierarchy over the American dataset with recall 70.0% and precision 69.48% for top-10 assigned codes; thereby being comparable to text-based techniques. Furthermore, the specialized loss function performs consistently better than the non-hierarchical state-of-the-art version. Moreover, results suggest the proposed method is language and dataset-agnostic, with initial indications of transferability over subsets of disease codes.",

keywords = "Diagnosis and Classification, Patient Assessment, Classification Algorithms, Curriculum Learning, Medication History",

author = "Hansen, {Emil Riis} and Tomer Sagi and Katja Hose and Lip, {Gregory Y. H.} and Larsen, {Torben Bjerregaard} and Flemming Skj{\o}th",

year = "2022",

month = jun,

doi = "10.1016/j.artmed.2022.102307",

language = "English",

volume = "128",

journal = "Artificial Intelligence in Medicine",

issn = "0933-3657",

publisher = "Elsevier",

number = "1",

}

TY - JOUR

T1 - Assigning Diagnosis Codes Using Medication History

AU - Hansen, Emil Riis

AU - Sagi, Tomer

AU - Hose, Katja

AU - Lip, Gregory Y. H.

AU - Larsen, Torben Bjerregaard

AU - Skjøth, Flemming

PY - 2022/6

Y1 - 2022/6

N2 - Diagnosis assignment is the process of assigning disease codes to patients. Automatic diagnosis assignment has the potential to validate code assignments, correct erroneous codes, and register completion. Previous methods build on text-based techniques utilizing medical notes but are inapplicable in the absence of these notes. We propose using patients' medication data to assign diagnosis codes. We present a proof-of-concept study using medical data from an American dataset (MIMIC-III) and Danish nationwide registers to train a machine-learning-based model that predicts an extensive collection of diagnosis codes for multiple levels of aggregation over a disease hierarchy. We further suggest a specialized loss function designed to utilize the innate hierarchical nature of the disease hierarchy. We evaluate the proposed method on a subset of 567 disease codes. Moreover, we investigate the technique's generalizability and transferability by (1) training and testing models on the same subsets of disease codes over the two medical datasets and (2) training models on the American dataset while evaluating them on the Danish dataset, respectively. Results demonstrate the proposed method can correctly assign diagnosis codes on multiple levels of aggregation from the disease hierarchy over the American dataset with recall 70.0% and precision 69.48% for top-10 assigned codes; thereby being comparable to text-based techniques. Furthermore, the specialized loss function performs consistently better than the non-hierarchical state-of-the-art version. Moreover, results suggest the proposed method is language and dataset-agnostic, with initial indications of transferability over subsets of disease codes.

AB - Diagnosis assignment is the process of assigning disease codes to patients. Automatic diagnosis assignment has the potential to validate code assignments, correct erroneous codes, and register completion. Previous methods build on text-based techniques utilizing medical notes but are inapplicable in the absence of these notes. We propose using patients' medication data to assign diagnosis codes. We present a proof-of-concept study using medical data from an American dataset (MIMIC-III) and Danish nationwide registers to train a machine-learning-based model that predicts an extensive collection of diagnosis codes for multiple levels of aggregation over a disease hierarchy. We further suggest a specialized loss function designed to utilize the innate hierarchical nature of the disease hierarchy. We evaluate the proposed method on a subset of 567 disease codes. Moreover, we investigate the technique's generalizability and transferability by (1) training and testing models on the same subsets of disease codes over the two medical datasets and (2) training models on the American dataset while evaluating them on the Danish dataset, respectively. Results demonstrate the proposed method can correctly assign diagnosis codes on multiple levels of aggregation from the disease hierarchy over the American dataset with recall 70.0% and precision 69.48% for top-10 assigned codes; thereby being comparable to text-based techniques. Furthermore, the specialized loss function performs consistently better than the non-hierarchical state-of-the-art version. Moreover, results suggest the proposed method is language and dataset-agnostic, with initial indications of transferability over subsets of disease codes.

KW - Diagnosis and Classification

KW - Patient Assessment

KW - Classification Algorithms

KW - Curriculum Learning

KW - Medication History

UR - http://www.scopus.com/inward/record.url?scp=85129310671&partnerID=8YFLogxK

U2 - 10.1016/j.artmed.2022.102307

DO - 10.1016/j.artmed.2022.102307

M3 - Journal article

SN - 0933-3657

VL - 128

JO - Artificial Intelligence in Medicine

JF - Artificial Intelligence in Medicine

IS - 1

M1 - 102307

ER -

Assigning Diagnosis Codes Using Medication History

Abstract

UN SDGs

Access to Document

AUB Link

Other files and links

Poul Due Jensen Professorate in Big Data and Artificial Intelligence

Representing Health Data and Medical Knowledge for Deep Learning

Towards Assigning Diagnosis Codes Using Medication History

Cite this

Assigning Diagnosis Codes Using Medication History

Abstract

UN SDGs

Access to Document

AUB Link

Other files and links

Projects

Poul Due Jensen Professorate in Big Data and Artificial Intelligence

Research output

Representing Health Data and Medical Knowledge for Deep Learning

Towards Assigning Diagnosis Codes Using Medication History

Cite this