Open Access
Description:
Natural language understanding is an important topic in natural language proces- sing. Given a text, a computer program should, at the very least, be able to under- stand what the text is about, and ideally also situate it in its extra-textual context and understand what purpose it serves. What exactly it means to understand what a text is about is an open question, but it is generally accepted that, at a minimum, un- derstanding involves being able to answer questions like “Who did what to whom? Where? When? How? And Why?”. Entity analysis, the computational analysis of entities mentioned in a text, aims to support answering the questions “Who?” and “Whom?” by identifying entities mentioned in a text. If the answers to “Where?” and “When?” are specific, named locations and events, entity analysis can also pro- vide these answers. Entity analysis aims to answer these questions by performing entity linking, that is, linking mentions of entities to their corresponding entry in a knowledge base, coreference resolution, that is, identifying all mentions in a text that refer to the same entity, and entity typing, that is, assigning a label such as Person to mentions of entities. In this thesis, we study how different aspects of coherence can be exploited to improve entity analysis. Our main contribution is a method that allows exploiting knowledge-rich, specific aspects of coherence, namely geographic, temporal, and entity type coherence. Geographic coherence expresses the intuition that entities mentioned in a text tend to be geographically close. Similarly, temporal coherence captures the intuition that entities mentioned in a text tend to be close in the tem- poral dimension. Entity type coherence is based in the observation that in a text about a certain topic, such as sports, the entities mentioned in it tend to have the same or related entity types, such as sports team or athlete. We show how to integrate features modeling these aspects of coherence into entity linking systems and esta- blish their utility in ...
Year of Publication:
2019
Document Type:
Dissertation ; info:eu-repo/semantics/doctoralThesis ; NonPeerReviewed ; [Doctoral and postdoctoral thesis]
Language:
eng
Subjects:
ddc-004 ; 004 Data processing Computer science
Rights:
info:eu-repo/semantics/openAccess ; http://archiv.ub.uni-heidelberg.de/volltextserver/help/license_urhg.html
Relations:
https://archiv.ub.uni-heidelberg.de/volltextserverhttps://archiv.ub.uni-heidelberg.de/volltextser... ; doi:10.11588/heidok.00026117 ; urn:nbn:de:bsz:16-heidok-261173 ; Heinzerling, Benjamin (2019) Aspects of Coherence for Entity Analysis. [Dissertation]
Content Provider:
Universität Heidelberg: HeiDok (Heidelberger Dokumentenserver)
Further nameHeidelberg University: HeiDok  Flag of Germany
Loading ...
Loading ...
Loading ...