Teaching Analytics Medical-Data Common Sense

Tomer Sagi; Nitzan Shmueli; Bruce Friedman; Ruth Bergman

doi:10.1007/978-3-030-71055-2_14

Teaching Analytics Medical-Data Common Sense

Tomer Sagi^*, Nitzan Shmueli, Bruce Friedman, Ruth Bergman

^*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Abstract

The availability of Electronic Medical Records (EMR) has spawned the development of analytics designed to assist caregivers in monitoring, diagnosis, and treatment of patients. The long-term adoption of these tools hinges upon caregivers’ confidence in them, and subsequently, their robustness to data anomalies. Unfortunately, both complex machine-learning-based tools, which require copious amounts of data to train, and a simple trend graph presented in a patient-centered dashboard, may be sensitive to noisy data. While a caregiver would dismiss a heart rate of 2000, a medical analytic relying on it may fail or mislead its users. Developers should endow their systems with medical-data common sense to shield them from improbable values. To effectively do so, they require the ability to identify them. We motivate the need to teach analytics common sense by evaluating how anomalies impact visual-analytics, score-based sepsis-analytics SOFA and qSOFA, and a machine-learning-based sepsis predictor. We then describe the anomalous patterns designers should look for in medical data using a popular public medical research database - MIMIC-III. For each data type, we highlight methods to find these patterns. For numerical data, statistical methods are limited to high-throughput scenarios and large aggregations. Since deployed analytics monitor a single patient and must rely on a limited amount of data, rule-based methods are needed. In light of the dearth of medical guidelines to support such systems, we outline the dimensions upon which they should be defined upon.

Original language	English
Title of host publication	Heterogeneous Data Management, Polystores, and Analytics for Healthcare : VLDB Workshops, Poly 2020 and DMAH 2020, Virtual Event, August 31 and September 4, 2020, Revised Selected Papers
Publisher	Springer
Publication date	4 Mar 2021
Pages	171-187
ISBN (Print)	978-3-030-71054-5
DOIs	https://doi.org/10.1007/978-3-030-71055-2_14
Publication status	Published - 4 Mar 2021
Event	VLDB Workshop on Data Management and Analytics for Medicine and Healthcare - Online Duration: 4 Sept 2020 → 4 Sept 2020

Conference

Conference	VLDB Workshop on Data Management and Analytics for Medicine and Healthcare
Location	Online
Period	04/09/2020 → 04/09/2020

Series	Lecture Notes in Computer Science (LNCS)
Volume	12633
ISSN	0302-9743

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1007/978-3-030-71055-2_14

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

Sagi, Tomer ; Shmueli, Nitzan ; Friedman, Bruce et al. / Teaching Analytics Medical-Data Common Sense. Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2020 and DMAH 2020, Virtual Event, August 31 and September 4, 2020, Revised Selected Papers. Springer, 2021. pp. 171-187 (Lecture Notes in Computer Science (LNCS), Vol. 12633).

@inproceedings{da0c352cd0af41e3a3b25d228fe34744,

title = "Teaching Analytics Medical-Data Common Sense",

abstract = "The availability of Electronic Medical Records (EMR) has spawned the development of analytics designed to assist caregivers in monitoring, diagnosis, and treatment of patients. The long-term adoption of these tools hinges upon caregivers{\textquoteright} confidence in them, and subsequently, their robustness to data anomalies. Unfortunately, both complex machine-learning-based tools, which require copious amounts of data to train, and a simple trend graph presented in a patient-centered dashboard, may be sensitive to noisy data. While a caregiver would dismiss a heart rate of 2000, a medical analytic relying on it may fail or mislead its users. Developers should endow their systems with medical-data common sense to shield them from improbable values. To effectively do so, they require the ability to identify them. We motivate the need to teach analytics common sense by evaluating how anomalies impact visual-analytics, score-based sepsis-analytics SOFA and qSOFA, and a machine-learning-based sepsis predictor. We then describe the anomalous patterns designers should look for in medical data using a popular public medical research database - MIMIC-III. For each data type, we highlight methods to find these patterns. For numerical data, statistical methods are limited to high-throughput scenarios and large aggregations. Since deployed analytics monitor a single patient and must rely on a limited amount of data, rule-based methods are needed. In light of the dearth of medical guidelines to support such systems, we outline the dimensions upon which they should be defined upon.",

author = "Tomer Sagi and Nitzan Shmueli and Bruce Friedman and Ruth Bergman",

year = "2021",

month = mar,

day = "4",

doi = "10.1007/978-3-030-71055-2_14",

language = "English",

isbn = "978-3-030-71054-5",

series = "Lecture Notes in Computer Science (LNCS)",

publisher = "Springer",

pages = "171--187",

booktitle = "Heterogeneous Data Management, Polystores, and Analytics for Healthcare",

address = "Germany",

note = "VLDB Workshop on Data Management and Analytics for Medicine and Healthcare, DMAH 2020 ; Conference date: 04-09-2020 Through 04-09-2020",

}

Sagi, T, Shmueli, N, Friedman, B & Bergman, R 2021, Teaching Analytics Medical-Data Common Sense. in Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2020 and DMAH 2020, Virtual Event, August 31 and September 4, 2020, Revised Selected Papers. Springer, Lecture Notes in Computer Science (LNCS), vol. 12633, pp. 171-187, VLDB Workshop on Data Management and Analytics for Medicine and Healthcare, 04/09/2020. https://doi.org/10.1007/978-3-030-71055-2_14

Teaching Analytics Medical-Data Common Sense. / Sagi, Tomer; Shmueli, Nitzan; Friedman, Bruce et al.
Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2020 and DMAH 2020, Virtual Event, August 31 and September 4, 2020, Revised Selected Papers. Springer, 2021. p. 171-187 (Lecture Notes in Computer Science (LNCS), Vol. 12633).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Teaching Analytics Medical-Data Common Sense

AU - Sagi, Tomer

AU - Shmueli, Nitzan

AU - Friedman, Bruce

AU - Bergman, Ruth

PY - 2021/3/4

Y1 - 2021/3/4

N2 - The availability of Electronic Medical Records (EMR) has spawned the development of analytics designed to assist caregivers in monitoring, diagnosis, and treatment of patients. The long-term adoption of these tools hinges upon caregivers’ confidence in them, and subsequently, their robustness to data anomalies. Unfortunately, both complex machine-learning-based tools, which require copious amounts of data to train, and a simple trend graph presented in a patient-centered dashboard, may be sensitive to noisy data. While a caregiver would dismiss a heart rate of 2000, a medical analytic relying on it may fail or mislead its users. Developers should endow their systems with medical-data common sense to shield them from improbable values. To effectively do so, they require the ability to identify them. We motivate the need to teach analytics common sense by evaluating how anomalies impact visual-analytics, score-based sepsis-analytics SOFA and qSOFA, and a machine-learning-based sepsis predictor. We then describe the anomalous patterns designers should look for in medical data using a popular public medical research database - MIMIC-III. For each data type, we highlight methods to find these patterns. For numerical data, statistical methods are limited to high-throughput scenarios and large aggregations. Since deployed analytics monitor a single patient and must rely on a limited amount of data, rule-based methods are needed. In light of the dearth of medical guidelines to support such systems, we outline the dimensions upon which they should be defined upon.

AB - The availability of Electronic Medical Records (EMR) has spawned the development of analytics designed to assist caregivers in monitoring, diagnosis, and treatment of patients. The long-term adoption of these tools hinges upon caregivers’ confidence in them, and subsequently, their robustness to data anomalies. Unfortunately, both complex machine-learning-based tools, which require copious amounts of data to train, and a simple trend graph presented in a patient-centered dashboard, may be sensitive to noisy data. While a caregiver would dismiss a heart rate of 2000, a medical analytic relying on it may fail or mislead its users. Developers should endow their systems with medical-data common sense to shield them from improbable values. To effectively do so, they require the ability to identify them. We motivate the need to teach analytics common sense by evaluating how anomalies impact visual-analytics, score-based sepsis-analytics SOFA and qSOFA, and a machine-learning-based sepsis predictor. We then describe the anomalous patterns designers should look for in medical data using a popular public medical research database - MIMIC-III. For each data type, we highlight methods to find these patterns. For numerical data, statistical methods are limited to high-throughput scenarios and large aggregations. Since deployed analytics monitor a single patient and must rely on a limited amount of data, rule-based methods are needed. In light of the dearth of medical guidelines to support such systems, we outline the dimensions upon which they should be defined upon.

U2 - 10.1007/978-3-030-71055-2_14

DO - 10.1007/978-3-030-71055-2_14

M3 - Article in proceeding

SN - 978-3-030-71054-5

T3 - Lecture Notes in Computer Science (LNCS)

SP - 171

EP - 187

BT - Heterogeneous Data Management, Polystores, and Analytics for Healthcare

PB - Springer

T2 - VLDB Workshop on Data Management and Analytics for Medicine and Healthcare

Y2 - 4 September 2020 through 4 September 2020

ER -

Sagi T, Shmueli N, Friedman B, Bergman R. Teaching Analytics Medical-Data Common Sense. In Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2020 and DMAH 2020, Virtual Event, August 31 and September 4, 2020, Revised Selected Papers. Springer. 2021. p. 171-187. (Lecture Notes in Computer Science (LNCS), Vol. 12633). doi: 10.1007/978-3-030-71055-2_14

Teaching Analytics Medical-Data Common Sense

Abstract

Conference

UN SDGs

Access to Document

AUB Link

Fingerprint

Cite this