skip to main content
10.1145/2487575.2488214acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Empirical bayes model to combine signals of adverse drug reactions

Published: 11 August 2013 Publication History

Abstract

Data mining is a crucial tool for identifying risk signals of potential adverse drug reactions (ADRs). However, mining of ADR signals is currently limited to leveraging a single data source at a time. It is widely believed that combining ADR evidence from multiple data sources will result in a more accurate risk identification system. We present a methodology based on empirical Bayes modeling to combine ADR signals mined from ~5 million adverse event reports collected by the FDA, and healthcare data corresponding to 46 million patients' the main two types of information sources currently employed for signal detection. Based on four sets of test cases (gold standard), we demonstrate that our method leads to a statistically significant and substantial improvement in signal detection accuracy, averaging 40% over the use of each source independently, and an area under the ROC curve of 0.87. We also compare the method with alternative supervised learning approaches, and argue that our approach is preferable as it does not require labeled (training) samples whose availability is currently limited. To our knowledge, this is the first effort to combine signals from these two complementary data sources, and to demonstrate the benefits of a computationally integrative strategy for drug safety surveillance.

References

[1]
Adverse Event Reporting System. http://www.fda.gov/cder/aers/default.htm.
[2]
Food and Drug Administration Amendments Act (FDAAA) of 2007.
[3]
Medical Dictionary for Regulatory Activities (MedDRA). http://www.meddramsso.com/.
[4]
Observational Medical Outcomes Partnership (OMOP). http://omop.fnih.org/.
[5]
Oracle Health Sciences. http://www.oracle.com/us/products/applications/health-sciences/safety/empirica-signal/index.html.
[6]
S. Ahmad. Adverse drug event monitoring at the food and drug administration - your report can make a difference. Journal of General Internal Medicine, 18(1):57--60, 2003.
[7]
A. Bate and S. Evans. Quantitative signal detection using spontaneous ADR reporting. Pharmacoepidemiol.Drug Saf, 18(6):427--436, 2009.
[8]
D. Classen, S. Pestotnik, R. Evans, J. Lloyd, and J. Burke. Adverse drug events in hospitalized patients. excess length of stay, extra costs, and attributable mortality. JAMA, 277(4):301--306, 1997.
[9]
P. Coloma, M. Schuemie, G. Trifiro, R. Gini, R. Herings, J. Hippisley-Cox, G. Mazzaglia, C. Giaquinto, G. Corrao, L. Pedersen, L. J. van der, and M. Sturkenboom. Combining electronic healthcare databases in europe to allow for large-scale drug safety monitoring: the EU-ADR project. Pharmacoepidemiol.Drug Saf, 20(1):1--11, 2011.
[10]
P. M. Coloma, G. Trifiro, V. Patadia, and M. Sturkenboom. Postmarketing safety surveillance. Drug Safety, pages 1--15, 2013.
[11]
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B, 39(1):1--38, 1977.
[12]
W. DuMouchel. Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system. Am Stat., 53(3):177--190, 1999.
[13]
A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis, Second Edition. Chapman and Hall/CRC, 2003.
[14]
R. Harpaz, W. DuMouchel, N. H. Shah, D. Madigan, P. Ryan, and C. Friedman. Novel data-mining methodologies for adverse drug event discovery and analysis. Nature-Clin Pharmacol Ther, 91(6):1010--1021, 2012.
[15]
J. Lazarou, B. Pomeranz, and P. Corey. Incidence of adverse drug reactions in hospitalized patients: a meta-analysis of prospective studies. JAMA, 279(15):1200--1205, 1998.
[16]
R. Platt, M. Wilson, K. Chan, J. Benner, J. Marchibroda, and M. McClellan. The new Sentinel Network - improving the evidence of medical-product safety. New England Journal of Medicine, 361(7):645--647, 2009.
[17]
X. Robin, N. Turck, A. Hainard, N. Tiberti, F. Lisacek, J.-C. Sanchez, and M. Muller. pROC: an open-source package for r and s
[18]
to analyze and compare roc curves. BMC Bioinformatics, 12(1):77, 2011.
[19]
P. B. Ryan, D. Madigan, P. E. Stang, J. Marc Overhage, J. A. Racoosin, and A. G. Hartzema. Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership. Statistics in Medicine, 31(30):4401--4415, 2012.
[20]
P. Stang, P. Ryan, J. Racoosin, J. Overhage, A. Hartzema, C. Reich, E. Welebob, T. Scarnecchia, and J. Woodcock. Advancing the science for active surveillance: Rationale and design for the Observational Medical Outcomes Partnership. Annals of Internal Medicine, 153(9):600--W206, 2010.
[21]
W. Stephenson and M. Hauben. Data mining for signals in spontaneous reporting databases: proceed with caution. Pharmacoepidemiol.Drug Saf, 16(4):359--365, 2007.
[22]
G. Trifiro, A. Pariente, P. M. Coloma, J. A. Kors, G. Polimeni, G. Miremont-Salame, M. A. Catania, F. Salvo, A. David, N. Moore, A. P. Caputi, M. Sturkenboom, M. Molokhia, J. Hippisley-Cox, C. D. Acedo, J. van der Lei, and A. Fourrier-Reglat. Data mining on electronic health record databases for signal detection in pharmacovigilance: which events to monitor? Pharmacoepidemiology and Drug Safety, 18(12):1176--1184, 2009.
[23]
D. Wysowski and L. Swartz. Adverse drug event surveillance and drug withdrawals in the united states, 1969--2002 - the importance of reporting suspected reactions. Archives of Internal Medicine, 165(12):1363--1369, 2005.

Cited By

View all

Index Terms

  1. Empirical bayes model to combine signals of adverse drug reactions

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2013
      1534 pages
      ISBN:9781450321747
      DOI:10.1145/2487575
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 August 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. empirical bayes
      2. pharmacovigilance
      3. signal detection

      Qualifiers

      • Research-article

      Conference

      KDD' 13
      Sponsor:

      Acceptance Rates

      KDD '13 Paper Acceptance Rate 125 of 726 submissions, 17%;
      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 17 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media