Recurrent deep network models for clinical NLP tasks: Use case with sentence boundary disambiguation

Benjamin C. Knoll, Elizabeth A. Lindemann, Arian L. Albert, Genevieve B. Melton, Serguei V.S. Pakhomov

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

Although a number of foundational natural language processing (NLP) tasks like text segmentation are considered a simple problem in the general English domain dominated by well-formed text, complexities of clinical documentation lead to poor performance of existing solutions designed for the general English domain. We present an alternative solution that relies on a convolutional neural network layer followed by a bidirectional long short-term memory layer (CNN-Bi-LSTM) for the task of sentence boundary disambiguation and describe an ensemble approach for domain adaptation using two training corpora. Implementations using the Keras neural-networks API are available at https://github.com/NLPIE/clinical-sentences.

Original languageEnglish (US)
Title of host publicationMEDINFO 2019
Subtitle of host publicationHealth and Wellbeing e-Networks for All - Proceedings of the 17th World Congress on Medical and Health Informatics
EditorsBrigitte Seroussi, Lucila Ohno-Machado, Lucila Ohno-Machado, Brigitte Seroussi
PublisherIOS Press
Pages198-202
Number of pages5
Volume264
ISBN (Electronic)9781643680026
DOIs
StatePublished - Aug 21 2019
Event17th World Congress on Medical and Health Informatics, MEDINFO 2019 - Lyon, France
Duration: Aug 25 2019Aug 30 2019

Publication series

NameStudies in health technology and informatics
ISSN (Print)0926-9630

Conference

Conference17th World Congress on Medical and Health Informatics, MEDINFO 2019
Country/TerritoryFrance
CityLyon
Period8/25/198/30/19

Bibliographical note

Funding Information:
We would like to acknowledge Michael Hietpas, one of the annotators for our data. This research was supported in part by NIH/NCATS UL1TR002494, NIH/NCATS U01TR002062, NIH/NIGMS R01GM102282, and AHRQ R01HS022085. The content is solely the responsibility of the authors and does not necessary represent the official views of the National Institutes of Health or the Agency for Healthcare Research and Quality.

Publisher Copyright:
© 2019 International Medical Informatics Association (IMIA) and IOS Press. This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0).

Keywords

  • Machine Learning
  • Natural Language Processing
  • Neural Networks (Computer)
  • Neural Networks, Computer
  • Language
  • Documentation

PubMed: MeSH publication types

  • Journal Article

Fingerprint

Dive into the research topics of 'Recurrent deep network models for clinical NLP tasks: Use case with sentence boundary disambiguation'. Together they form a unique fingerprint.

Cite this