skip to main content
research-article
Free access
Just Accepted

DeepMedFeature: An Accurate Feature Extraction and Drug-Drug Interaction Model for Clinical Text in Medical Informatics

Online AM: 09 March 2024 Publication History

Abstract

Drug-drug interactions (DDIs) are an important biological phenomenon which can result in medical errors from medical practitioners. Drug interactions can change the molecular structure of interacting agents which may prove to be fatal in the worst case. Finding drug interactions early in diagnosis can be pivotal in side-effect prevention. The growth of big data provides a rich source of information for clinical studies to investigate DDIs. We propose a hierarchical classification model which is double-pass in nature. The first pass predicts the occurrence of an interaction and then the second pass further predicts the type of interaction such as effect, advice, mechanism, and int. We applied different deep learning algorithms with Convolutional Bi-LSTM (ConvBLSTM) proving to be the best. The results show that pre-trained vector embeddings prove to be the most appropriate features. The F1-score of the ConvBLSTM algorithm turned out to be 96.39% and 98.37% in Russian and English language respectively which is greater than the state-of-the-art systems. According to the results, it can be concluded that adding a convolution layer before the bi-directional pass improves model performance in the automatic classification and extraction of drug interactions, using pre-trained vector embeddings such as Fasttext and Bio-Bert.

References

[1]
Rita Rezaee, Reza Akbari, Milad Nasiri, Farzaneh Foroughinia, and Nasrin Shokrpour. 2018. An evaluation of classification algorithms for prediction of drug interactions: Identification of the best algorithm. International Journal of Pharmaceutical Investigation 8, 2(2018), 92–99.
[2]
Candida J Rebello, Stephen Boué, Ronald J Levy Jr, Renée Puyau, Robbie A Beyl, Frank L Greenway, Mark L Heiman, Jeffrey N Keller, Charles F Reynolds III, and John P Kirwan. 2023. Safety and Tolerability of Whole Soybean Products: A Dose-Escalating Clinical Trial in Older Adults with Obesity. Nutrients 15, 8 (2023), 1920.
[3]
Masaki Asada, Makoto Miwa, and Yutaka Sasaki. 2022. Integrating heterogeneous knowledge graphs into drug–drug interaction extraction from the literature. Bioinformatics 39, 1 (11 2022). arXiv:https://academic.oup.com/bioinformatics/article-pdf/39/1/btac754/48448763/btac754.pdf btac754.
[4]
Zhao Xiaoyan, Deng Yang, Yang Min, Wang Lingzhi, Zhang Rui, Cheng Hong, Lam Wai, Shen Ying, and Xu Ruifeng. 2023. A Comprehensive Survey on Deep Learning for Relation Extraction: Recent Advances and New Frontiers. arXiv preprint arXiv:2306.02051(2023).
[5]
Isabel Segura-Bedmar, Paloma Martínez Fernández, and María Herrero Zazo. 2013. Semeval-2013 task 9: Extraction of drug-drug interactions from biomedical texts (ddiextraction 2013). Association for Computational Linguistics.
[6]
Anass Raihani and Nabil Laachfoubi. 2016. EXTRACTING DRUG-DRUG INTERACTIONS FROM BIOMEDICAL TEXT USING A FEATURE-BASED KERNEL APPROACH.Journal of Theoretical & Applied Information Technology 92, 1(2016).
[7]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.
[8]
Remzi Celebi, Huseyin Uyar, Erkan Yasar, Ozgur Gumus, Oguz Dikenelli, and Michel Dumontier. 2019. Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction in realistic settings. BMC bioinformatics 20, 1 (2019), 1–14.
[9]
Hakime Öztürk, Elif Ozkirimli, and Arzucan Özgür. 2018. A novel methodology on distributed representations of proteins using their interacting ligands. Bioinformatics 34, 13 (2018), i295–i303.
[10]
Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of Tricks for Efficient Text Classification. arXiv preprint arXiv:1607.01759(2016).
[11]
P Arabie, L Hubert, G De Soete, and A Gordon. 1996. Hierarchical classification. P. Arabie, L. Hubert, G. De Soete, & A. Gordon, Clustering and classification (1996), 65–121.
[12]
Tomas Pranckevičius and Virginijus Marcinkevičius. 2016. Application of logistic regression with part-of-the-speech tagging for multi-class text classification. In 2016 IEEE 4th workshop on advances in information, electronic and electrical engineering (AIEEE). IEEE, 1–5.
[13]
Christopher D Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Support vector machines and machine learning on documents. Introduction to Information Retrieval(2008), 319–348.
[14]
Li-Yue Bai, Hao Dai, Qin Xu, Muhammad Junaid, Shao-Liang Peng, Xiaolei Zhu, Yi Xiong, and Dong-Qing Wei. 2018. Prediction of effective drug combinations by an improved naïve bayesian algorithm. International journal of molecular sciences 19, 2(2018), 467.
[15]
Isabel Segura-Bedmar, Paloma Martinez, and Cesar de Pablo-Sánchez. 2011. Using a shallow linguistic kernel for drug–drug interaction extraction. Journal of biomedical informatics 44, 5 (2011), 789–804.
[16]
Regina Sousa, José Machado, Carla Rodrigues, and Luis Mendes Gomes. 2023. Drug-Drug Interaction Extraction-Based System: an NLP Approach. (2023).
[17]
Chengcheng Zhang, Yao Lu, and Tianyi Zang. 2022. CNN-DDI: a learning-based method for predicting drug–drug interactions using convolution neural networks. BMC bioinformatics 23, 1 (2022), 1–11.
[18]
Yijia Zhang, Wei Zheng, Hongfei Lin, Jian Wang, Zhihao Yang, and Michel Dumontier. 2018. Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths. Bioinformatics 34, 5 (2018), 828–835.
[19]
Maryam KafiKang and Abdeltawab Hendawi. 2023. Drug-Drug Interaction Extraction from Biomedical Text Using Relation BioBERT with BLSTM. Machine Learning and Knowledge Extraction 5, 2 (2023), 669–683. https://www.mdpi.com/2504-4990/5/2/36
[20]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.
[21]
Savio LY Lam and Dik Lun Lee. 1999. Feature reduction for neural network based text categorization. In Proceedings. 6th international conference on advanced systems for advanced applications. IEEE, 195–202.
[22]
Mei-Neng Wang, Yu Li, Li-Lan Lei, De-Wu Ding, and Xue-Jun Xie. 2023. Combining non-negative matrix factorization with graph Laplacian regularization for predicting drug-miRNA associations based on multi-source information fusion. Frontiers in Pharmacology 14 (2023).
[23]
Jianqing Fan, Qiang Sun, Wen-Xin Zhou, and Ziwei Zhu. 2018. Principal component analysis for big data. arXiv preprint arXiv:1801.01602(2018).
[24]
Daniel Svozil, Vladimir Kvasnicka, and Jiri Pospichal. 1997. Introduction to multi-layer feed-forward neural networks. Chemometrics and intelligent laboratory systems 39, 1 (1997), 43–62.
[25]
Phil Kim. 2017. Convolutional neural network. In MATLAB deep learning. Springer, 121–147.
[26]
Kaisheng Yao, Trevor Cohn, Katerina Vylomova, Kevin Duh, and Chris Dyer. 2015. Depth-gated LSTM. arXiv preprint arXiv:1508.03790(2015).
[27]
Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991(2015).
[28]
Sunil Kumar Sahu and Ashish Anand. 2018. Drug-drug interaction extraction from biomedical texts using long short-term memory network. Journal of biomedical informatics 86 (2018), 15–24.
[29]
Mohammad Hossin and Md Nasir Sulaiman. 2015. A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process 5, 2(2015), 1.
[30]
Guillaume Lemaître, Fernando Nogueira, and Christos K Aridas. 2017. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. The Journal of Machine Learning Research 18, 1 (2017), 559–563.

Index Terms

  1. DeepMedFeature: An Accurate Feature Extraction and Drug-Drug Interaction Model for Clinical Text in Medical Informatics

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing Just Accepted
      EISSN:2375-4702
      Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Online AM: 09 March 2024
      Accepted: 02 February 2024
      Revised: 07 November 2023
      Received: 27 June 2023

      Check for updates

      Author Tags

      1. Drug Drug Interactions
      2. Hierarchical Classification
      3. Deep Learning
      4. Convolutional Bi-LSTM
      5. Pre-trained Vector Embeddings

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 229
        Total Downloads
      • Downloads (Last 12 months)229
      • Downloads (Last 6 weeks)43
      Reflects downloads up to 23 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Full Access

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media