skip to main content
survey
Open access

Modeling Information Retrieval by Formal Logic: A Survey

Published: 21 February 2019 Publication History

Abstract

Several mathematical frameworks have been used to model the information retrieval (IR) process, among them, formal logics. Logic-based IR models upgrade the IR process from document-query comparison to an inference process, in which both documents and queries are expressed as sentences of the selected formal logic. The underlying formal logic also permits one to represent and integrate knowledge in the IR process. One of the main obstacles that has prevented the adoption and large-scale diffusion of logic-based IR systems is their complexity. However, several logic-based IR models have been recently proposed that are applicable to large-scale data collections. In this survey, we present an overview of the most prominent logical IR models that have been proposed in the literature. The considered logical models are categorized under different axes, which include the considered logics and the way in which uncertainty has been modeled, for example, degrees of belief or degrees of truth. Accordingly, the main contribution of the article is to categorize the state-of-the-art logical models on a fine-grained basis, and for the considered models the related implementation aspects are described. Consequently, the proposed survey is finalized to better understand and compare the different logical IR models. Last, but not least, this article aims at reconsidering the potentials of logical approaches to IR by outlining the advances of logic-based approaches in close research areas.

References

[1]
Karam Abdulahhad. 2014. Information Retrieval Modeling by Logic and Lattice. Application to Conceptual Information Retrieval. Ph.D. Dissertation. Ecole Doctorale Mathématiques, Sciences et Technologies de l’Information, Informatique, Grenoble, France.
[2]
Karam Abdulahhad, Jean-Pierre Chevallet, and Catherine Berrut. 2013. Is uncertain logical-matching equivalent to conditional probability? In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Dublin, Ireland, 825--828.
[3]
Karam Abdulahhad, Jean-Pierre Chevallet, and Catherine Berrut. 2017. Logics, lattices and probability: The missing links to information retrieval. Comput. J. 60, 7 (2017), 995--1018. arXiv:/oup/backfile/content_public/journal/comjnl/60/7/10.1093_comjnl_bxw034/1/bxw034.pdf
[4]
Gianni Amati and Stephan S. Kerpedjiev. 1992. An information retrieval logic model: Implementation and experiments. IEEE Transactions on Reliability 48 (1992).
[5]
Gianni Amati and Cornelis Joost Van Rijsbergen. 2002. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inf. Syst. 20, 4 (Oct. 2002), 357--389.
[6]
Horacio Arlo-Costa and Paul Egré. 2016. The logic of conditionals. In The Stanford Encyclopedia of Philosophy (Winter 2016 ed.), Edward N. Zalta (Ed.). Metaphysics Research Lab, Stanford University.
[7]
Franz Baader, Diego Calvanese, Deborah L. McGuinness, Daniele Nardi, and Peter F. Patel-Schneider (Eds.). 2003. The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, New York, NY.
[8]
Ricardo A. Baeza-Yates and Berthier Ribeiro-Neto. 1999. Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc., Boston, MA.
[9]
J. Barwise. 1989. The Situation in Logic. Center for the Study of Language and Information, Stanford, CA. http://books.google.fr/books?id=aX7RKgvpJw8C.
[10]
J. Barwise and J. Perry. 1983. Situations and Attitudes. MIT Press, Cambridge, MA. http://books.google.fr/books?id=DCTXAAAAMAAJ.
[11]
Luigi Bellomarini, Georg Gottlob, Andreas Pieris, and Emanuel Sallinger. 2017. Swift logic for big data and knowledge graphs. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). AAAI Press, 2--10.
[12]
Patrick Bosc, Vincent Claveau, Olivier Pivert, and Laurent Ughetto. 2009. Graded-Inclusion-Based Information Retrieval Systems. In Proceedings of Advances in Information Retrieval: 31st European Conference on IR Research (ECIR’09), Toulouse, France, April 6-9, 2009.Springer, Berlin, 252--263.
[13]
K. Boukhari and M. N. Omri. 2017. Information retrieval based on description logic: Application to biomedical documents. In 2017 International Conference on High Performance Computing Simulation (HPCS’17). 846--853.
[14]
Pablo Castells, Miriam Fernandez, and David Vallet. 2007. An adaptation of the vector-space model for ontology-based information retrieval. IEEE Trans. on Knowl. Data Eng. 19, 2 (Feb. 2007), 261--272.
[15]
Michel Chein and Marie-laure Mugnier. 1992. Conceptual graphs: Fundamental notions. Revue d’Intelligence Artificielle 6 (1992), 365--406. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.37.848.
[16]
Jean-Pierre Chevallet and Yves Chiaramella. 1995. Extending a logic-based retrieval model with algebraic knowledge. In MIRO Multimedia Information Retrieval, Final Workshop. 9 pages. https://hal.inria.fr/hal-00953974.
[17]
Jean-Pierre Chevallet and Yves Chiaramella. 1998. Experiences in information retrieval modelling using structured formalisms and modal logic. In Information Retrieval: Uncertainty and Logics, Fabio Crestani, Mounia Lalmas, and CornelisJoost Rijsbergen (Eds.). Springer US, New York, 39--72.
[18]
Stéphane Clinchant and Eric Gaussier. 2010. Information-based models for ad hoc IR. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Geneva, Switzerland, 234--241.
[19]
W. S. Cooper. 1978. Foundations of Logico-Linguistics: A Unified Theory of Information, Language, and Logic. Springer, the Netherlands. https://books.google.it/books?id=QZ0tLAwn0yMC.
[20]
R. T. Cox. 1946. Probability, frequency and reasonable expectation. American Journal of Physics 14, 1 (1946), 1--13.
[21]
Fabio Crestani. 1998. Logical imaging and probabilistic information retrieval. In Information Retrieval: Uncertainty and Logics, Fabio Crestani, Mounia Lalmas, and CornelisJoost Rijsbergen (Eds.). The Kluwer International Series on Information Retrieval, Vol. 4. Springer US, 247--279.
[22]
Fabio Crestani and Mounia Lalmas. 2001. Logic and uncertainty in information retrieval. In Lectures on Information Retrieval, Maristella Agosti, Fabio Crestani, and Gabriella Pasi (Eds.). Springer Berlin Heidelberg.
[23]
Fabio Crestani and C. J. Van Rijsbergen. 1995. Information retrieval by logical imaging. Journal of Documentation 51 (1995), 3--17.
[24]
Fabio Crestani, Ian Ruthven, Marc Sanderson, and C. J. van Rijsbergen. 1995. The troubles with using a logical model of IR on a large collection of documents. In Proceedings of the Fourth Text Retrieval Conference (TREC-4'95), D. K. Harman (Eds.). 509--526.
[25]
Fabio Crestani and C. J. van Rijsbergen. 1995. Probability kinematics in information retrieval. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’95). ACM, New York, NY, 291--299.
[26]
Mukesh Dalal. 1988. Investigations into a theory of knowledge base revision: Preliminary report. In Proceedings of the 7th National Conference on Artificial Intelligence, Paul Rosenbloom and Peter Szolovits (Eds.), Vol. 2. AAAI Press, Menlo Park, CA, 475--479.
[27]
Sandor Dominich. 2008. The Modern Algebra of Information Retrieval. Springer, Berlin. http://books.google.fr/books?id=uEedNKV3nlUC.
[28]
Didier Dubois, Walenty Ostasiewicz, and Henri Prade. 2000. Fuzzy Sets: History and Basic Notions. Springer US, Boston, MA, 21--124.
[29]
Didier Dubois and Henri Prade. 2001. Possibility theory, probability theory and multiple-valued logics: A clarification. Annals of Mathematics and Artificial Intelligence 32, 1--4 (Aug. 2001), 35--66.
[30]
Dorothy Edgington. 2014. Indicative conditionals. In The Stanford Encyclopedia of Philosophy (Winter 2014 ed.), Edward N. Zalta (Ed.). Metaphysics Research Lab, Stanford University. https://plato.stanford.edu/archives/win2014/entries/conditionals/.
[31]
Peter Exner and Pierre Nugues. 2012. Entity extraction: From unstructured text to DBpedia RDF triples. In Proceedings of the Web of Linked Entities Workshop in Conjunction with the 11th International Semantic Web Conference (ISWC’12). CEUR, 58--69.
[32]
Miriam Fernández, Iván Cantador, Vanesa López, David Vallet, Pablo Castells, and Enrico Motta. 2011. Semantically enhanced information retrieval: An ontology-based approach. Web Semant. 9, 4 (Dec. 2011), 434--452.
[33]
Thomas J. Froehlich. 1994. Relevance reconsidered—Towards an agenda for the 21st century: Introduction to special topic issue on relevance research. J. Am. Soc. Inf. Sci. 45, 3 (April 1994), 124--134.
[34]
Norbert Fuhr. 1995. Probabilistic datalog - A logic for powerful retrieval methods. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Seattle, WA, 282--290.
[35]
Norbert Fuhr. 2000. Probabilistic datalog: Implementing logical information retrieval for advanced applications. JASIS 51, 2 (2000), 95--110.
[36]
Norbert Fuhr and Thomas Rölleke. 1998. HySpirit — A probabilistic inference engine for hypermedia retrieval in large databases. In Advances in database technology — EDBT’98, Hans-Jörg Schek, Gustavo Alonso, Felix Saltor, and Isidro Ramos (Eds.). Springer, Berlin, 24--38.
[37]
Peter Gardenfors. 1982. Imaging and conditionalization. The Journal of Philosophy 79, 12 (1982), 747--760.
[38]
Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’09). ACM, New York, NY, 267--274.
[39]
Yogesh Gupta, Ashish Saini, and Ak Saxena. 2014. Fuzzy logic-based approach to develop hybrid similarity measure for efficient information retrieval. J. Inf. Sci. 40, 6 (Dec. 2014), 846--857.
[40]
R. Haenni, J. Kohlas, and N. Lehmann. 2000. Probabilistic Argumentation Systems. Springer Netherlands, Dordrecht, 221--288.
[41]
Theodore Hailperin. 1984. Probability logic. Notre Dame J. Formal Logic 25, 3 (07 1984), 198--212.
[42]
Alan Hájek. 2001. Probability, logic, and probability logic. The Blackwell Guide to Philosophical Logic, Blackwell (2001), 362--384.
[43]
Faegheh Hasibi, Fedor Nikolaev, Chenyan Xiong, Krisztian Balog, Svein Erik Bratsberg, Alexander Kotov, and Jamie Callan. 2017. DBpedia-entity V2: A test collection for entity search. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’17). ACM, New York, NY, 1265--1268.
[44]
G. E. Hughes and J. Cresswell. 1996. A New Introduction to Modal Logic. Routledge. https://books.google.fr/books?id=Dsn1xWNB4MEC.
[45]
T. W. C. Huibers and P. D. Bruza. 1994. Situations: A General Framework for Studying Information Retrieval. Utrecht University, Department of Computer Science. http://books.google.fr/books?id=YtcuGwAACAAJ.
[46]
Anthony Hunter. 1995. Using default logic in information retrieval. In Symbolic and Quantitative Approaches to Reasoning and Uncertainty, Christine Froidevaux and Jürg Kohlas (Eds.). Springer Berlin Heidelberg, Springer, Berlin, 235--242.
[47]
Kevin H. Knuth. 2004. Deriving laws from ordering relations. AIP Conference Proceedings 707, 1 (2004), 204--235.
[48]
Kevin H. Knuth. 2005. Lattice duality: The origin of probability and entropy. Neurocomput. 67 (Aug. 2005), 245--274.
[49]
Daphne Koller, Alon Levy, and Avi Pfeffer. 1997. P-CLASSIC: A tractable probabilistic description logic. In Proceedings of AAAI’97. AAAI Press, Menlo Park, CA, Providence, RI, 390--397.
[50]
S. A. Kripke. 1963. Semantic analysis of modal logic I: Normal modal and propositional calculi. Zeitschrift für Mathematische Logik und Grundlagen der Mathematik 9 (1963), 67--96.
[51]
Mounia Lalmas. 1998. Logical models in information retrieval: Introduction and overview. In Information Processing 8 Management. 34, 1 (1998), 19--33.
[52]
Mounia Lalmas and Peter D. Bruza. 1998. The use of logic in information retrieval modelling. The Knowledge Engineering Review 13, 3 (10 1998), 263--295.
[53]
Mounia Lalmas and Keith Rijsbergen. 1993. A logical model of information retrieval based on situation theory. In 14th Information Retrieval Colloquium, Tony McEnery and Chris Paice (Eds.). Springer London, 1--13.
[54]
Raymond Y. K. Lau, Peter D. Bruza, and Dawei Song. 2008. Towards a belief-revision-based adaptive and context-sensitive information retrieval system. ACM Trans. Inf. Syst. 26, 2, Article 8 (April 2008), 38 pages.
[55]
David Lewis. 1976. Probabilities of conditionals and conditional probabilities. The Philosophical Review 85, 3 (1976), 297--315. http://www.jstor.org/stable/2184045.
[56]
David K. Lewis. 1973. Counterfactuals. Harvard University Press, Cambridge, MA. http://www.worldcat.org/oclc/795075.
[57]
Christina Lioma, Roi Blanco, Raquel Mochales Palau, and Marie-Francine Moens. 2009. A belief model of query difficulty that uses subjective logic. In Advances in Information Retrieval Theory, Leif Azzopardi, Gabriella Kazai, Stephen Robertson, Stefan Rüger, Milad Shokouhi, Dawei Song, and Emine Yilmaz (Eds.). Springer, Berlin, 92--103.
[58]
Christina Lioma, Birger Larsen, Hinrich Schuetze, and Peter Ingwersen. 2010. A subjective logic formalisation of the principle of polyrepresentation for information needs. In Proceedings of the 3rd Symposium on Information Interaction in Context (IIiX’10). ACM, New York, NY, 125--134.
[59]
David E. Losada and Alvaro Barreiro. 2001. A logical model for information retrieval based on propositional logic and belief revision. Comput. J. 44, 5 (2001), 410--424.
[60]
David E. Losada and Alvaro Barreiro. 2003. Propositional logic representations for documents and queries: A large-scale evaluation. In Proceedings of the 25th European Conference on IR Research (ECIR’03). Springer, Berlin, 219--234. http://dl.acm.org/citation.cfm?id=1757788.1757810.
[61]
Thomas Lukasiewicz. 2008. Expressive probabilistic description logics. Artif. Intell. 172, 6-7 (April 2008), 852--883.
[62]
Loic Maisonnasse, Eric Gaussier, and Jean-Pierre Chevallet. 2009. Model fusion in conceptual language modeling. In Proceedings of the 31st European Conference on IR Research on Advances in Information Retrieval (ECIR’09). Springer, Berlin, 240--251.
[63]
Carlo Meghini, Fabrizio Sebastiani, Umberto Straccia, and Costantino Thanos. 1993. A model of information retrieval based on a terminological logic. In Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Pittsburgh, PA, 298--307.
[64]
Carlo Meghini and Umberto Straccia. 1996. A relevance terminological logic for information retrieval. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’96). ACM, New York, NY, 197--205.
[65]
Stefano Mizzaro. 1997. Relevance: The whole history. Journal of the American Society for Information Science 48, 9 (1997), 810--832.
[66]
B. Nebel. 1992. Syntax-based approaches to belief revision. In Belief Revision. Cambridge University Press, New York, NY, 52--88.
[67]
J. Nie. 1988. An outline of a general model for information retrieval systems. In Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Grenoble, France, 495--506.
[68]
Jianyun Nie. 1989. An information retrieval model based on modal logic. Information Processing 8 Management 25, 5 (1989), 477--491.
[69]
Jian-Yun Nie. 1992. Towards a probabilistic modal logic for semantic-based information retrieval. In Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Copenhagen, Denmark, 140--151.
[70]
Jian-Yun Nie and Martin Brisebois. 1996. An inferential approach to information retrieval and its implementation using a manual thesaurus. Artif. Intell. Rev. 10, 5--6 (Oct. 1996), 409--439.
[71]
Jian-Yun Nie and Francois Lepage. 1998. Toward a broader logical model for information retrieval. In Information Retrieval: Uncertainty and Logics, Fabio Crestani, Mounia Lalmas, and CornelisJoost van Rijsbergen (Eds.). The Kluwer International Series on Information Retrieval, Vol. 4. Springer US, 17--38.
[72]
Gabriella Pasi. 1999. A logical formulation of the Boolean model and of weighted Boolean models. In Proceedings of the Workshop on Logical and Uncertainty Models for Information Systems, LUMIS, at ECSQARU’99. Éditions Universitaires d’Avignon, 1--11. http://dblp.uni-trier.de/db/conf/coria/coria2011.html#AbdulahhadCB11.
[73]
Paolo Penna. 2000. Succinct representations of model based belief revision. In STACS 2000, Horst Reichel and Sophie Tison (Eds.). Lecture Notes in Computer Science, Vol. 1770. Springer, Berlin, 205--216.
[74]
Justin Picard and Jacques Savoy. 2000. A logical information retrieval model based on a combination of propositional logic and probability theory. In Soft Computing in Information Retrieval, Fabio Crestani and Gabriella Pasi (Eds.). Physica-Verlag HD, Heidelberg.
[75]
Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Melbourne, Australia, 275--281.
[76]
Guilin Qi and Jeff Z. Pan. 2008. A tableau algorithm for possibilistic description logic ALC. In Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web (ASWC’08). Springer, Berlin, 61--75.
[77]
Saïd Radhouani, Gilles Falquet, and Jean-Pierre Chevalletinst. 2008. Description logic to model a domain specific information retrieval system. In Database and Expert Systems Applications, Sourav S. Bhowmick, Josef Küng, and Roland Wagner (Eds.). Springer, Berlin, 142--149.
[78]
S. E. Robertson and K. S. Jones. 1976. Relevance weighting of search terms. J. Am. Soc. Inf. Sci. 27, 3 (1976), 129--146.
[79]
S. E. Robertson and S. Walker. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Springer, New York, NY, Dublin, Ireland, 232--241. http://dl.acm.org/citation.cfm?id=188490.188561
[80]
Thomas Rölleke and Norbert Fuhr. 1996. Retrieval of complex objects using a four-valued logic. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Zurich, Switzerland, 206--214.
[81]
Thomas Rölleke, Hengzhi Wu, Jun Wang, and Hany Azzam. 2008. Modelling retrieval models in a probabilistic relational algebra with a new operator: The relational Bayes. VLDB J. 17, 1 (2008), 5--37.
[82]
Neil Rubens. 2006. The application of fuzzy logic to the construction of the ranking function of information retrieval systems. CoRR abs/cs/0610039 (2006). arxiv:cs/0610039 http://arxiv.org/abs/cs/0610039.
[83]
G. Salton, A. Wong, and C. S. Yang. 1975. A vector space model for automatic indexing. Commun. ACM 18, 11 (Nov. 1975), 613--620.
[84]
Christoph Schwering. 2017. A reasoning system for a first-order logic of limited belief. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). AAAI Press, 1247--1253.
[85]
Fabrizio Sebastiani. 1994. A probabilistic terminological logic for modelling information retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Springer, New York, NY, Dublin, Ireland, 122--130. http://portal.acm.org/citation.cfm?id=188490.188544.
[86]
Fabrizio Sebastiani. 1998. On the role of logic in information retrieval. Information Processing 8 Management 34, 1 (1998), 1--18.
[87]
Fabrizio Sebastiani. 1999. Towards a logical reconstruction of information retrieval theory. Cybernet. Syst 30 (1999), 411--428.
[88]
Fabrizio Sebastiani and Umberto Straccia. 1991. A computationally tractable terminological logic. In SCAI. 307--315.
[89]
Amit Singhal, Chris Buckley, and Mandar Mitra. 1996. Pivoted document length normalization. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, Zurich, Switzerland, 21--29.
[90]
John F. Sowa. 2010. The Role of Logic and Ontology in Language and Reasoning. Springer Netherlands, Dordrecht, 231--263.
[91]
Laurent Ughetto, Gabriella Pasi, Vincent Claveau, Olivier Pivert, and Patrick Bosc. 2010. Implication in information retrieval systems. In Adaptivity, Personalization and Fusion of Heterogeneous Information (RIAO’10). Le Centre de Hautes Etudes Internationales d’Informatique Documentaire, Paris, France, 61--64. http://dl.acm.org/citation.cfm?id=1937055.1937068.
[92]
David Vallet, Miriam Fernández, and Pablo Castells. 2005. An ontology-based information retrieval model. In The Semantic Web: Research and Applications, Asunción Gómez-Párez and Jéróme Euzenat (Eds.). Lecture Notes in Computer Science, Vol. 3532. Springer, Berlin, 455--470.
[93]
C. J. van Rijsbergen. 1977. A theoretical basis for the use of co-occurrence data in information retrieval. Journal of Documentation 33 (June 1977), 106--119. http://libra.msra.cn/paperdetail.aspx?id=1292204.
[94]
C. J. van Rijsbergen. 1986. A non-classical logic for information retrieval. Comput. J. 29, 6 (1986), 481--485. http://dblp.uni-trier.de/db/journals/cj/cj29.html#Rijsbergen86.
[95]
Yunjie Xu and Hainan Yin. 2008. Novelty and topicality in interactive information retrieval. J. Am. Soc. Inf. Sci. Technol. 59, 2 (Jan. 2008), 201--215.
[96]
Yunjie (Calvin) Xu and Zhiwei Chen. 2006. Relevance judgment: What do information users consider beyond topicality? J. Am. Soc. Inf. Sci. Technol. 57, 7 (May 2006), 961--973.
[97]
L. A. Zadeh. 1965. Fuzzy sets. Information and Control 8, 3 (1965), 338--353.
[98]
Loutfi Zerarga and Yassine Djouadi. 2018. A many-sorted theory proposal for information retrieval: Axiomatization and semantics. Knowledge and Information Systems 55, 1 (01 Apr 2018), 113--139.
[99]
Guido Zuccon, Leif Azzopardi, and Cornelis J. van Rijsbergen. 2009. Revisiting logical imaging for information retrieval. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’09). ACM, New York, NY, 766--767.
[100]
G. Zuccon, L. A. Azzopardi, and C. J. van Rijsbergen. 2008. A formalization of logical imaging for information retrieval using quantum theory. In Proceedings of the19th International Conference on Database and Expert Systems Application (DEXA’08). IEEE Computer Society, Washington, DC, 3--8.

Cited By

View all

Index Terms

  1. Modeling Information Retrieval by Formal Logic: A Survey

    Recommendations

    Reviews

    Jolanta MizeraPietraszko

    As the title indicates, formal logic is used for modeling information retrieval (IR). Readers can expect a literature review (of IR models), supported with graphs, mathematical formulas, and examples that lead to some interesting conclusions. For those working in the area of IR, it may be a controversial assumption that "logic-based IR models upgrade the IR process from document-query comparison to an inference process," as the very first IR classical models relied just on logic-for instance, matching query to documents, query formulation, query-document matrix, definition of relevance, and so on. Semantic queries express inferences studied by the IR community without formal logic theory; therefore, its omission is a huge gap of this work. Karen Spärck Jones, a pioneer of term frequency inverse document frequency (tf-idf), is not even mentioned; Gerard Salton, "the father of IR," has only one reference. Another huge gap is the survey structure logic. Instead of developing the concepts of modeling IR, showing the progress of formal logic applications in the area over the years, the article gives mathematical formulas proposed by some researchers and chosen somehow at random. I would appreciate some examples, as well as more information about which models became popular and which went unnoticed. Readers will first learn some IR terminology in a very generic sense. The potential of this work is presented next, defined as logical models, to help understand relevance, or represent non-logical models or include domain knowledge. Logic-based models are listed with the respective authors: document and query representation, implication-based categorization, uncertainty versus multiple truth values, ranking to uncertainty, and propositional models. The limited references for the models make it difficult to see which of them have really contributed to IR over the years, and there are no graphs, diagrams, or pseudocode representations. The article may be recommended to mathematicians who like formal logic from a theoretical viewpoint only, without any vision of implementation to IR.

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Computing Surveys
    ACM Computing Surveys  Volume 52, Issue 1
    January 2020
    758 pages
    ISSN:0360-0300
    EISSN:1557-7341
    DOI:10.1145/3309872
    • Editor:
    • Sartaj Sahni
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 February 2019
    Accepted: 01 November 2018
    Revised: 01 August 2018
    Received: 01 September 2017
    Published in CSUR Volume 52, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Formal logics
    2. information retrieval models
    3. logical models
    4. survey
    5. uncertainty

    Qualifiers

    • Survey
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)268
    • Downloads (Last 6 weeks)27
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media