research-article

Named Entity Recognition for Spoken Finnish

Authors:

Dejan Porjazovski,

Mikko KurimoAuthors Info & Claims

AI4TV '20: Proceedings of the 2nd International Workshop on AI for Smart TV Content Production, Access and Delivery

Pages 25 - 29

https://doi.org/10.1145/3422839.3423066

Published: 12 October 2020 Publication History

Abstract

In this paper we present a Bidirectional LSTM neural network with a Conditional Random Field layer on top, which utilizes word, character and morph embeddings in order to perform named entity recognition on various Finnish datasets. To overcome the lack of annotated training corpora that arises when dealing with low-resource languages like Finnish, we tried a knowledge transfer technique to transfer tags from Estonian dataset. On the human annotated in-domain Digitoday dataset, out system achieved F1 score of 84.73. On the out-of-domain Wikipedia set we got F1 score of 67.66. In order to see how well the system performs on speech data, we used two datasets containing automatic speech recognition outputs. Since we do not have true labels for those datasets, we used a rule-based system to annotate them and used those annotations as reference labels. On the first dataset which contains Finnish parliament sessions we obtained F1 score of 42.09 and on the second one which contains talks from Yle Pressiklubi we obtained F1 score of 74.54.

References

[1]

Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. Journal of machine learning research 12, Aug (2011), 2493--2537.

Digital Library

[2]

Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraçlar, and Andreas Stolcke. 2007. Morph-based speech recognition and modeling of out-of-vocabulary words across languages. ACM Transactions on Speech and Language Processing (TSLP) 5, 1 (2007), 3.

[3]

Dimitra Farmakiotou, Vangelis Karkaletsis, John Koutsias, George Sigletos, Constantine D Spyropoulos, and Panagiotis Stamatopoulos. 2000. Rule-based named entity recognition for Greek financial texts. In Proceedings of the Workshop on Computational lexicography and Multimedia Dictionaries (COMLEX 2000). 75--78.

[4]

Xiaocheng Feng, Xiachong Feng, Bing Qin, Zhangyin Feng, and Ting Liu. 2018. Improving Low Resource Named Entity Recognition using Cross-lingual Knowledge Transfer. In IJCAI. 4071--4077.

[5]

Onur Güngör, Suzan Üsküdarl?, and Tunga Güngör. 2018. Improving Named Entity Recognition by Jointly Learning to Disambiguate Morphological Tags. arXiv preprint arXiv:1807.06683 (2018).

[6]

Teemu Hirsimaki, Janne Pylkkonen, and Mikko Kurimo. 2009. Importance of high-order n-gram models in morph-based speech recognition. IEEE Transactions on Audio, Speech, and Language Processing 17, 4 (2009), 724--732.

Digital Library

[7]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780.

[8]

Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015).

[9]

Kentaro Torisawa. 2008. Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In proceedings of ACL-08: HLT. 407--415.

[10]

Onur Kuru, Ozan Arkan Can, and Deniz Yuret. 2016. Charner: Character-level named entity recognition. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. 911--921.

[11]

John Lafferty, Andrew McCallum, and Fernando CN Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. (2001).

[12]

Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, and Jiawei Han. 2019. On the Variance of the Adaptive Learning Rate and Beyond. arXiv preprint arXiv:1908.03265 (2019).

[13]

Xuezhe Ma and Eduard Hovy. 2016. End-to-end sequence labeling via bidirectional lstm-cnns-crf. arXiv preprint arXiv:1603.01354 (2016).

[14]

Andrew McCallum and Wei Li. 2003. Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, 188--191.

Digital Library

[15]

Lance A Ramshaw and Mitchell P Marcus. 1999. Text chunking using transformation-based learning. In Natural language processing using very large corpora. Springer, 157--176.

[16]

Teemu Ruokolainen, Pekka Kauppinen, Miikka Silfverberg, and Krister Lindén. 2019. A Finnish news corpus for named entity recognition. Language Resources and Evaluation (2019), 1--26.

[17]

Cicero Nogueira dos Santos and Victor Guimaraes. 2015. Boosting named entity recognition with neural character embeddings. arXiv preprint arXiv:1505.05008 (2015).

[18]

Burr Settles. 2004. Biomedical named entity recognition using conditional random fields and rich feature sets. In Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications (NLPBA/BioNLP). 107--110.

Digital Library

[19]

Vesa Siivola, Teemu Hirsimaki, Mathias Creutz, and Mikko Kurimo. 2003. Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner. In Eighth European Conference on Speech Communication and Technology.

[20]

Peter Smit, Sami Virpioja, Stig-Arne Grönroos, and Mikko Kurimo. 2014. Morfessor 2.0: Toolkit for statistical morphological segmentation. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics. 21--24.

[21]

Peter Smit, Sami Virpioja, Mikko Kurimo, et al. 2017. Improved Subword Modeling for WFST-Based Speech Recognition. In INTERSPEECH. 2551--2555.

[22]

Cornelis Joost Van Rijsbergen. 1979. Information retrieval. (1979).

[23]

Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A Smith, and Jaime Carbonell. 2018. Neural cross-lingual named entity recognition with minimal resources. arXiv preprint arXiv:1808.09861 (2018).

Cited By

Braun SStarr KDelfani JTiittula LLaaksonen JBraeckman KVan Rijsselbergen DLagrillière SSaarikoski L(2021)When Worlds Collide: AI-Created, Human-Mediated Video Description Services and the User ExperienceHCI International 2021 - Late Breaking Papers: Cognition, Inclusion, Learning, and Culture10.1007/978-3-030-90328-2_10(147-167)Online publication date: 13-Nov-2021
https://doi.org/10.1007/978-3-030-90328-2_10
Porjazovski DLeinonen JKurimo M(2021)Attention-Based End-to-End Named Entity Recognition from SpeechText, Speech, and Dialogue10.1007/978-3-030-83527-9_40(469-480)Online publication date: 30-Aug-2021
https://doi.org/10.1007/978-3-030-83527-9_40
Troncy RLaaksonen JTavakoli HNixon LMezaris VHosseini MWen Chen CCucchiara RHua XQi GRicci EZhang ZZimmermann R(2020)AI4TV 2020Proceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3421894(4756-4757)Online publication date: 12-Oct-2020
https://dl.acm.org/doi/10.1145/3394171.3421894

Index Terms

Named Entity Recognition for Spoken Finnish
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

Learning multilingual named entity recognition from Wikipedia

We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication

In natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
Unsupervised biomedical named entity recognition

Display Omitted BM-NER is approached by an unsupervised stepwise method.Noun phrase chunking is a good approximation of boundary detection.Distributional semantics works well in classifying entities.The system performs well on clinical and biological ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

AI4TV '20: Proceedings of the 2nd International Workshop on AI for Smart TV Content Production, Access and Delivery

October 2020

50 pages

ISBN:9781450381468

DOI:10.1145/3422839

Program Chairs:
Raphaël Troncy
EURECOM, France
,
Jorma Laaksonen
Aalto University, Finland
,
Hamed R.-Tavakoli
Nokia Technologies, Finland
,
Lyndon Nixon
MODUL Technology GmbH, Austria
,
Vasileios Mezaris
CERTH-ITI, Greece
,
Mohammad Hosseini
Comcast, USA

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 October 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Academy of Finland
MeMAD

Conference

MM '20

Sponsor:

SIGMM

MM '20: The 28th ACM International Conference on Multimedia

October 12, 2020

WA, Seattle, USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
103
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)3

Reflects downloads up to 26 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Braun SStarr KDelfani JTiittula LLaaksonen JBraeckman KVan Rijsselbergen DLagrillière SSaarikoski L(2021)When Worlds Collide: AI-Created, Human-Mediated Video Description Services and the User ExperienceHCI International 2021 - Late Breaking Papers: Cognition, Inclusion, Learning, and Culture10.1007/978-3-030-90328-2_10(147-167)Online publication date: 13-Nov-2021
https://doi.org/10.1007/978-3-030-90328-2_10
Porjazovski DLeinonen JKurimo M(2021)Attention-Based End-to-End Named Entity Recognition from SpeechText, Speech, and Dialogue10.1007/978-3-030-83527-9_40(469-480)Online publication date: 30-Aug-2021
https://doi.org/10.1007/978-3-030-83527-9_40
Troncy RLaaksonen JTavakoli HNixon LMezaris VHosseini MWen Chen CCucchiara RHua XQi GRicci EZhang ZZimmermann R(2020)AI4TV 2020Proceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3421894(4756-4757)Online publication date: 12-Oct-2020
https://dl.acm.org/doi/10.1145/3394171.3421894

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten