Relation Extraction between Medical Entities using Deep Learning Approach
DOI:
https://doi.org/10.31449/inf.v45i3.3056Abstract
Medical discharge summaries or patient prescriptions contain variety of medical terms. The semantic relation extraction between medical terms is essential for discovery of significant medical knowledge. The relation classification is one of the imperative tasks of biomedical information extraction. The automatic identification of relations between medical diseases, tests and treatments can improve the quality of patient care. This paper presents the deep learning based proposed system for relation extraction between medical entities. In this paper, convolution neural network is used for relation classification. The system is divided into four modules: word embedding, feature extraction, convolution and softmax classifier. The output contains classified relations between medical entities. In this work, data set provided by I2b2 2010 challenge is used for relation detection which consisted of total 9070 relations in test data and 5262 total relations in train data. The performance evaluation of relation extraction task is done using precision and recall. The system achieved average 75% precision and 72% recall. The performance of the system is compared with awarded i2b2 participated systems.Keywords: Convolution Neural Network;Feature Extraction;Relation Classification;Word Embedding.References
A.-L. Minard, A.-L. Ligozat, A. Ben Abacha, D. Bernhard, B. Cartoni, L. Deléger, et al., "Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification," Journal of the American Medical Informatics Association, vol. 18, p. 588, 2011.
N. Kang, R. J. Barendse, Z. Afzal, B. Singh, M. J. Schuemie, E. M. van Mulligen, et al., "Erasmus MC approaches to the i2b2 Challenge," in Proceedings of the 2010 i2b2/VA workshop on challenges in natural language processing for clinical data. Boston, MA, USA: i2b2, 2010.
B. deBruijn, C. Cherry, S. Kiritchenko, J. Martin, and X. Zhu, "NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features," in Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, 2010.
J. D. Patrick, D. H. M. Nguyen, Y. Wang, and M. Li, "A knowledge discovery and reuse pipeline for information extraction in clinical notes," Journal of the American Medical Informatics Association, vol. 18, pp. 574-579, 2011.
I. Solt, F. P. Szidarovszky, and D. Tikk, "Concept, Assertion and Relation Extraction at the 2010 i2b2 Relation Extraction Challenge using parsing information and dictionaries," Proc. of i2b2/VA Shared-Task. Washington, DC, 2010.
X. Zhu, C. Cherry, S. Kiritchenko, J. Martin, and B. De Bruijn, "Detecting concept relations in clinical text: Insights from a state-of-the-art model," Journal of biomedical informatics, vol. 46, pp. 275-285, 2013.
K. Roberts, B. Rink, and S. Harabagiu, "Extraction of medical concepts, assertions, and relations from discharge summaries for the fourth i2b2/VA shared task," in Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, 2010.
C. Grouin, A. B. Abacha, D. Bernhard, B. Cartoni, L. Deleger, B. Grau, et al., "CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches," in i2b2 Medication Extraction Challenge Workshop, 2010, pp. -.
R. J. Kate and R. J. Mooney, "Joint entity and relation extraction using card-pyramid parsing," in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010, pp. 203-212.
M. Liu, L. Jiang, and H. Hu, "Automatic extraction and visualization of semantic relations between medical entities from medicine instructions," Multimedia Tools and Applications, vol. 76, pp. 10555-10573, 2017.
O. Frunza and D. Inkpen, "Extracting relations between diseases, treatments, and tests from clinical data," in Canadian Conference on Artificial Intelligence, 2011, pp. 140-145.
O. Frunza, D. Inkpen, and T. Tran, "A machine learning approach for identifying disease-treatment relations in short texts," IEEE transactions on knowledge and data engineering, vol. 23, pp. 801-814, 2011.
C. Giuliano, A. Lavelli, and L. Romano, "Exploiting shallow linguistic information for relation extraction from biomedical literature," in 11th Conference of the European Chapter of the Association for Computational Linguistics, 2006.
W. W. Chapman, D. Chu, and J. N. Dowling, "ConText: An algorithm for identifying contextual features from clinical text," in Proceedings of the workshop on BioNLP 2007: biological, translational, and clinical language processing, 2007, pp. 81-88.
C. A. Bejan and J. C. Denny, "Learning to identify treatment relations in clinical text," in AMIA Annual Symposium Proceedings, 2014, p. 282.
D. Hristovski, C. Friedman, T. C. Rindflesch, and B. Peterlin, "Exploiting semantic relations for literature-based discovery," AMIA ... Annual Symposium proceedings. AMIA Symposium, vol. 2006, pp. 349-353, 2006.
O. Uzuner, J. Mailoa, R. Ryan, and T. Sibanda, "Semantic relations for problem-oriented medical records," Artificial intelligence in medicine, vol. 50, pp. 63-73, 2010.
M. Porumb, I. Barbantan, C. Lemnaru, and R. Potolea, "REMed: automatic relation extraction from medical documents," presented at the Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services, Brussels, Belgium, 2015.
J. Kim, Y. Choe, and K. Mueller, "Extracting Clinical Relations in Electronic Health Records Using Enriched Parse Trees," Procedia Computer Science, vol. 53, pp. 274-283, 2015/01/01/ 2015.
B. de Bruijn, C. Cherry, S. Kiritchenko, J. Martin, and X. Zhu, "Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010," Journal of the American Medical Informatics Association : JAMIA, vol. 18, pp. 557-562, Sep-Oct 2011.
Y. Xu, K. Hong, J. Tsujii, and E. I. C. Chang, "Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries," Journal of the American Medical Informatics Association : JAMIA, vol. 19, pp. 824-832, Sep-Oct 2012.
Ö. Uzuner, B. R. South, S. Shen, and S. L. DuVall, "2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text," Journal of the American Medical Informatics Association : JAMIA, vol. 18, pp. 552-556, Sep-Oct 2011.
D. Zeng, K. Liu, S. Lai, G. Zhou, and J. Zhao, "Relation classification via convolutional deep neural network," 2014.
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, "Distributed representations of words and phrases and their compositionality," in Advances in neural information processing systems, 2013, pp. 3111-3119.
Q. Le and T. Mikolov, "Distributed representations of sentences and documents," in International conference on machine learning, 2014, pp. 1188-1196.
Y. Wu, M. Jiang, J. Xu, D. Zhi, and H. Xu, "Clinical Named Entity Recognition Using Deep Learning Models," AMIA ... Annual Symposium proceedings. AMIA Symposium, vol. 2017, pp. 1812-1819, 2018.
O. Bodenreider, "The Unified Medical Language System (UMLS): integrating biomedical terminology," Nucleic Acids Research, vol. 32, pp. D267-D270, 2004.
Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika