Abstract
Although a number of foundational natural language processing (NLP) tasks like text segmentation are considered a simple problem in the general English domain dominated by well-formed text, complexities of clinical documentation lead to poor performance of existing solutions designed for the general English domain. We present an alternative solution that relies on a convolutional neural network layer followed by a bidirectional long short-term memory layer (CNN-Bi-LSTM) for the task of sentence boundary disambiguation and describe an ensemble approach for domain adaptation using two training corpora. Implementations using the Keras neural-networks API are available at https://github.com/NLPIE/clinical-sentences.
Original language | English (US) |
---|---|
Title of host publication | MEDINFO 2019 |
Subtitle of host publication | Health and Wellbeing e-Networks for All - Proceedings of the 17th World Congress on Medical and Health Informatics |
Editors | Brigitte Seroussi, Lucila Ohno-Machado, Lucila Ohno-Machado, Brigitte Seroussi |
Publisher | IOS Press |
Pages | 198-202 |
Number of pages | 5 |
Volume | 264 |
ISBN (Electronic) | 9781643680026 |
DOIs | |
State | Published - Aug 21 2019 |
Event | 17th World Congress on Medical and Health Informatics, MEDINFO 2019 - Lyon, France Duration: Aug 25 2019 → Aug 30 2019 |
Publication series
Name | Studies in health technology and informatics |
---|---|
ISSN (Print) | 0926-9630 |
Conference
Conference | 17th World Congress on Medical and Health Informatics, MEDINFO 2019 |
---|---|
Country/Territory | France |
City | Lyon |
Period | 8/25/19 → 8/30/19 |
Bibliographical note
Funding Information:We would like to acknowledge Michael Hietpas, one of the annotators for our data. This research was supported in part by NIH/NCATS UL1TR002494, NIH/NCATS U01TR002062, NIH/NIGMS R01GM102282, and AHRQ R01HS022085. The content is solely the responsibility of the authors and does not necessary represent the official views of the National Institutes of Health or the Agency for Healthcare Research and Quality.
Publisher Copyright:
© 2019 International Medical Informatics Association (IMIA) and IOS Press. This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0).
Keywords
- Machine Learning
- Natural Language Processing
- Neural Networks (Computer)
- Neural Networks, Computer
- Language
- Documentation
PubMed: MeSH publication types
- Journal Article