research-article

Free access

Just Accepted

DeepMedFeature: An Accurate Feature Extraction and Drug-Drug Interaction Model for Clinical Text in Medical Informatics

Authors:

M. Shoaib Malik,

Syed Atif Moqurrab,

Gautam SrivastavaAuthors Info & Claims

ACM Transactions on Asian and Low-Resource Language Information Processing

Accepted on 02 February 2024

https://doi.org/10.1145/3651159

Online AM: 09 March 2024 Publication History

Abstract

Drug-drug interactions (DDIs) are an important biological phenomenon which can result in medical errors from medical practitioners. Drug interactions can change the molecular structure of interacting agents which may prove to be fatal in the worst case. Finding drug interactions early in diagnosis can be pivotal in side-effect prevention. The growth of big data provides a rich source of information for clinical studies to investigate DDIs. We propose a hierarchical classification model which is double-pass in nature. The first pass predicts the occurrence of an interaction and then the second pass further predicts the type of interaction such as effect, advice, mechanism, and int. We applied different deep learning algorithms with Convolutional Bi-LSTM (ConvBLSTM) proving to be the best. The results show that pre-trained vector embeddings prove to be the most appropriate features. The F1-score of the ConvBLSTM algorithm turned out to be 96.39% and 98.37% in Russian and English language respectively which is greater than the state-of-the-art systems. According to the results, it can be concluded that adding a convolution layer before the bi-directional pass improves model performance in the automatic classification and extraction of drug interactions, using pre-trained vector embeddings such as Fasttext and Bio-Bert.

References

[1]

Rita Rezaee, Reza Akbari, Milad Nasiri, Farzaneh Foroughinia, and Nasrin Shokrpour. 2018. An evaluation of classification algorithms for prediction of drug interactions: Identification of the best algorithm. International Journal of Pharmaceutical Investigation 8, 2(2018), 92–99.

[2]

Candida J Rebello, Stephen Boué, Ronald J Levy Jr, Renée Puyau, Robbie A Beyl, Frank L Greenway, Mark L Heiman, Jeffrey N Keller, Charles F Reynolds III, and John P Kirwan. 2023. Safety and Tolerability of Whole Soybean Products: A Dose-Escalating Clinical Trial in Older Adults with Obesity. Nutrients 15, 8 (2023), 1920.

[3]

Masaki Asada, Makoto Miwa, and Yutaka Sasaki. 2022. Integrating heterogeneous knowledge graphs into drug–drug interaction extraction from the literature. Bioinformatics 39, 1 (11 2022). arXiv:https://academic.oup.com/bioinformatics/article-pdf/39/1/btac754/48448763/btac754.pdf btac754.

[4]

Zhao Xiaoyan, Deng Yang, Yang Min, Wang Lingzhi, Zhang Rui, Cheng Hong, Lam Wai, Shen Ying, and Xu Ruifeng. 2023. A Comprehensive Survey on Deep Learning for Relation Extraction: Recent Advances and New Frontiers. arXiv preprint arXiv:2306.02051(2023).

[5]

Isabel Segura-Bedmar, Paloma Martínez Fernández, and María Herrero Zazo. 2013. Semeval-2013 task 9: Extraction of drug-drug interactions from biomedical texts (ddiextraction 2013). Association for Computational Linguistics.

[6]

Anass Raihani and Nabil Laachfoubi. 2016. EXTRACTING DRUG-DRUG INTERACTIONS FROM BIOMEDICAL TEXT USING A FEATURE-BASED KERNEL APPROACH.Journal of Theoretical & Applied Information Technology 92, 1(2016).

[7]

Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.

[8]

Remzi Celebi, Huseyin Uyar, Erkan Yasar, Ozgur Gumus, Oguz Dikenelli, and Michel Dumontier. 2019. Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction in realistic settings. BMC bioinformatics 20, 1 (2019), 1–14.

[9]

Hakime Öztürk, Elif Ozkirimli, and Arzucan Özgür. 2018. A novel methodology on distributed representations of proteins using their interacting ligands. Bioinformatics 34, 13 (2018), i295–i303.

[10]

Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of Tricks for Efficient Text Classification. arXiv preprint arXiv:1607.01759(2016).

[11]

P Arabie, L Hubert, G De Soete, and A Gordon. 1996. Hierarchical classification. P. Arabie, L. Hubert, G. De Soete, & A. Gordon, Clustering and classification (1996), 65–121.

[12]

Tomas Pranckevičius and Virginijus Marcinkevičius. 2016. Application of logistic regression with part-of-the-speech tagging for multi-class text classification. In 2016 IEEE 4th workshop on advances in information, electronic and electrical engineering (AIEEE). IEEE, 1–5.

[13]

Christopher D Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Support vector machines and machine learning on documents. Introduction to Information Retrieval(2008), 319–348.

[14]

Li-Yue Bai, Hao Dai, Qin Xu, Muhammad Junaid, Shao-Liang Peng, Xiaolei Zhu, Yi Xiong, and Dong-Qing Wei. 2018. Prediction of effective drug combinations by an improved naïve bayesian algorithm. International journal of molecular sciences 19, 2(2018), 467.

[15]

Isabel Segura-Bedmar, Paloma Martinez, and Cesar de Pablo-Sánchez. 2011. Using a shallow linguistic kernel for drug–drug interaction extraction. Journal of biomedical informatics 44, 5 (2011), 789–804.

Digital Library

[16]

Regina Sousa, José Machado, Carla Rodrigues, and Luis Mendes Gomes. 2023. Drug-Drug Interaction Extraction-Based System: an NLP Approach. (2023).

[17]

Chengcheng Zhang, Yao Lu, and Tianyi Zang. 2022. CNN-DDI: a learning-based method for predicting drug–drug interactions using convolution neural networks. BMC bioinformatics 23, 1 (2022), 1–11.

[18]

Yijia Zhang, Wei Zheng, Hongfei Lin, Jian Wang, Zhihao Yang, and Michel Dumontier. 2018. Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths. Bioinformatics 34, 5 (2018), 828–835.

[19]

Maryam KafiKang and Abdeltawab Hendawi. 2023. Drug-Drug Interaction Extraction from Biomedical Text Using Relation BioBERT with BLSTM. Machine Learning and Knowledge Extraction 5, 2 (2023), 669–683. https://www.mdpi.com/2504-4990/5/2/36

[20]

Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.

[21]

Savio LY Lam and Dik Lun Lee. 1999. Feature reduction for neural network based text categorization. In Proceedings. 6th international conference on advanced systems for advanced applications. IEEE, 195–202.

[22]

Mei-Neng Wang, Yu Li, Li-Lan Lei, De-Wu Ding, and Xue-Jun Xie. 2023. Combining non-negative matrix factorization with graph Laplacian regularization for predicting drug-miRNA associations based on multi-source information fusion. Frontiers in Pharmacology 14 (2023).

[23]

Jianqing Fan, Qiang Sun, Wen-Xin Zhou, and Ziwei Zhu. 2018. Principal component analysis for big data. arXiv preprint arXiv:1801.01602(2018).

[24]

Daniel Svozil, Vladimir Kvasnicka, and Jiri Pospichal. 1997. Introduction to multi-layer feed-forward neural networks. Chemometrics and intelligent laboratory systems 39, 1 (1997), 43–62.

[25]

Phil Kim. 2017. Convolutional neural network. In MATLAB deep learning. Springer, 121–147.

[26]

Kaisheng Yao, Trevor Cohn, Katerina Vylomova, Kevin Duh, and Chris Dyer. 2015. Depth-gated LSTM. arXiv preprint arXiv:1508.03790(2015).

[27]

Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991(2015).

[28]

Sunil Kumar Sahu and Ashish Anand. 2018. Drug-drug interaction extraction from biomedical texts using long short-term memory network. Journal of biomedical informatics 86 (2018), 15–24.

[29]

Mohammad Hossin and Md Nasir Sulaiman. 2015. A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process 5, 2(2015), 1.

[30]

Guillaume Lemaître, Fernando Nogueira, and Christos K Aridas. 2017. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. The Journal of Machine Learning Research 18, 1 (2017), 559–563.

Digital Library

Index Terms

DeepMedFeature: An Accurate Feature Extraction and Drug-Drug Interaction Model for Clinical Text in Medical Informatics
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning

Recommendations

Drug–Drug Interaction Relation Extraction Based on Deep Learning: A Review
Drug–drug interaction (DDI) is an important part of drug development and pharmacovigilance. At the same time, DDI is an important factor in treatment planning, monitoring effects of medicine and patient safety, and has a significant impact on public ...
Drug Repurposing: Targeting mTOR Inhibitors for Anticancer Activity
CSBio '17: Proceedings of the 8th International Conference on Computational Systems-Biology and Bioinformatics

In the search of safer and more effective drugs while reducing costs and increasing productivity of novel drug discovery, scientists are changing their focus to an approach known as drug repurposing. This involves finding a new therapeutic effect of an ...
Deep Learning for High-Order Drug-Drug Interaction Prediction
BCB '19: Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics

Drug-drug interactions (DDIs) and their associated adverse drug reactions (ADRs) represent a significant detriment to the public health. Existing research on DDIs is primarily focused on pairwise DDI detection and prediction. It is highly needed to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian and Low-Resource Language Information Processing

ACM Transactions on Asian and Low-Resource Language Information Processing Just Accepted

EISSN:2375-4702

Table of Contents

Copyright © 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Online AM: 09 March 2024

Accepted: 02 February 2024

Revised: 07 November 2023

Received: 27 June 2023

Check for updates

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
229
Total Downloads

Downloads (Last 12 months)229
Downloads (Last 6 weeks)43

Reflects downloads up to 23 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables