MutationFinder: a high-performance system for extracting point mutation mentions from text

Bioinformatics. 2007 Jul 15;23(14):1862-5. doi: 10.1093/bioinformatics/btm235. Epub 2007 May 11.

Abstract

Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline.

Availability: MutationFinder, along with a high-quality gold standard data set, and a scoring script for mutation extraction systems have been made publicly available. Implementations, source code and unit tests are available in Python, Perl and Java. MutationFinder can be used as a stand-alone script, or imported by other applications.

Project url: http://bionlp.sourceforge.net.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • DNA Mutational Analysis
  • Databases, Bibliographic
  • Databases, Genetic
  • Databases, Protein
  • Genetic Techniques
  • Humans
  • Mutation*
  • Pattern Recognition, Automated
  • Point Mutation*
  • Publications
  • Reproducibility of Results
  • Software