Obesity Entity Extraction from Real Outpatient Records: When Learning-Based Methods Meet Small Imbalanced Medical Data Sets

Yihan Deng, Peter Dolog, Jörn-Markus Gass, Kerstin Denecke

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

1 Citation (Scopus)

Abstract

The postoperative health status of an obesity patient indicates the outcome of the surgical treatment. By each postoperative revisit, physicians need to go through the previous patient records to recall the patient status and to evaluate the postoperative risk of readmission. In order to support in this process, we develop a method to extract indicators and to analyse weight changes, so that potential complications and risks of clinical readmission can be recognized timely. In this paper, we will compare two approaches that are based on traditional machine learning and neural networks. Relevant aspects referring to a health status change or treatment-relevant aspects are extracted from the outpatient medical records as they are generated for each postoperative revisit. The performance of traditional machine learning on the task of obesity-related entity extraction is compared with one variation of attentive recurrent neural networks. The ensemble classifier of binary attentive bi-LSTM with the data balancing using conditional generative adversarial networks (CGAN) has achieved F1 measure of 86.5% on the task of classification of eight classes of obesity-related entities. We conclude that for processing a small data set using neural networks, a data balancing method should firstly be applied to achieve an extended corpus and a general representation, which can apparently increase the differentiability of the input data. A fine-tuning in the networks can provide further enhancement of the performance.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019
Number of pages6
PublisherIEEE
Publication date2019
Pages411-416
Article number8787533
ISBN (Print)978-1-7281-2286-1
ISBN (Electronic)9781728122861
DOIs
Publication statusPublished - 2019
EventIEEE International Symposium on Computer-Based Medical Systems: CBMS - Cordoba, Spain
Duration: 5 Jun 20197 Jun 2019
Conference number: 32nd

Conference

ConferenceIEEE International Symposium on Computer-Based Medical Systems
Number32nd
CountrySpain
CityCordoba
Period05/06/201907/06/2019
SeriesIEEE International Symposium on Computer-Based Medical Systems (CBMS)
ISSN2372-9198

Keywords

  • Bi LSTM
  • CGAN
  • Conditional random fields
  • Imbalanced data
  • Named entity recognition
  • Oversampling
  • Post operative obesity patient

Fingerprint Dive into the research topics of 'Obesity Entity Extraction from Real Outpatient Records: When Learning-Based Methods Meet Small Imbalanced Medical Data Sets'. Together they form a unique fingerprint.

Cite this