skip to main content
10.1145/3459637.3481924acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps

Published: 30 October 2021 Publication History

Abstract

Providing timely accessibility reminders (such as closed and relocated) of a point-of-interest (POI) plays a vital role in improving user satisfaction of finding places and making visiting decisions. However, it is difficult to keep the POI database in sync with the real-world counterparts due to the dynamic nature of business changes and innovations. To alleviate this problem, we formulate and present a practical solution that jointly extracts POI mentions and identifies their coupled accessibility labels from unstructured text (hereafter referred to as joint POI and accessibility extraction). We approach this task as a sequence tagging problem, where the goal is to produce (POI name, accessibility label) pairs from unstructured text. This task is challenging because of two main issues: (1) POI names are often newly-coined words so as to successfully register new entities or brands and (2) there may exist multiple pairs in the text, which necessitates dealing with one-to-many or many-to-one mapping to make each POI coupled with its matching accessibility label. To this end, we propose a Geographic-Enhanced and Dependency-guIded sequence Tagging (GEDIT) model to concurrently address the two challenges. First, to alleviate challenge #1, we develop a geographic-enhanced pre-trained model to learn the text representations, which is able to significantly relieve the problem of newly-coined words. Second, to mitigate challenge #2, we apply a relational graph convolutional network to learn the tree node representations from the parsed dependency tree, which enables us to establish a correlation between a POI and its accessibility label. Finally, we construct a neural sequence tagging model by integrating and feeding the previously pre-learned representations into a CRF layer. Extensive experiments conducted on a real-world dataset demonstrate the superiority and effectiveness of GEDIT. In addition, it has already been deployed in production at Baidu Maps, and it successfully keeps processing hundreds of thousands of Web documents every week. Statistics show that the proposed solution can save significant human effort and labor costs to deal with the same amount of documents, which confirms that it is a practical way for POI accessibility maintenance.

References

[1]
Abien Fred Agarap. 2018. Deep Learning using Rectified Linear Units (ReLU). arXiv preprint arXiv:1803.08375 (2018).
[2]
Dirk Ahlers. 2013. Business Entity Retrieval and Data Provision for Yellow Pages by Local Search. In IRPS Workshop @ ECIR2013.
[3]
Jason P. C. Chiu and Eric Nichols. 2016. Named Entity Recognition with Bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguistics, Vol. 4 (2016), 357--370.
[4]
Hsiu-Min Chuang, Chia-Hui Chang, Ting-Yao Kao, Chung-Ting Cheng, Ya-Yun Huang, and Kuo-Pin Cheong. 2016. Enabling Maps/Location Searches on Mobile Devices: Constructing a POI Database via Focused Crawling and Information Extraction. Int. J. Geogr. Inf. Sci., Vol. 30, 7 (2016), 1405--1425.
[5]
Hsiu-Min Chuang, Chia-Hui Chang, and Wang-Chien Lee. 2018. Detecting Outdated POI Relations via Web-Derived Features. Trans. GIS, Vol. 22, 5 (2018), 1238--1256.
[6]
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel P. Kuksa. 2011. Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res., Vol. 12 (2011), 2493--2537.
[7]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4171--4186.
[8]
Miao Fan, Yibo Sun, Jizhou Huang, Haifeng Wang, and Ying Li. 2021. Meta-Learned Spatial-Temporal POI Auto-Completion for the Search Engine at Baidu Maps. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2822--2830.
[9]
Xiaomin Fang, Jizhou Huang, Fan Wang, Lihang Liu, Yibo Sun, and Haifeng Wang. 2021. SSML: Self-Supervised Meta-Learner for En Route Travel Time Estimation at Baidu Maps. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2840--2848.
[10]
Xiaomin Fang, Jizhou Huang, Fan Wang, Lingke Zeng, Haijin Liang, and Haifeng Wang. 2020. ConSTGAT: Contextual Spatial-Temporal Graph Attention Network for Travel Time Estimation at Baidu Maps. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2697--2705.
[11]
Radu Florian, Abraham Ittycheriah, Hongyan Jing, and Tong Zhang. 2003. Named Entity Recognition through Classifier Combination. In Proceedings of the Seventh Conference on Natural Language Learning. 168--171.
[12]
G David Forney. 1973. The Viterbi Algorithm. Proc. IEEE, Vol. 61, 3 (1973), 268--278.
[13]
Tao Gui, Ruotian Ma, Qi Zhang, Lujun Zhao, Yu-Gang Jiang, and Xuanjing Huang. 2019. CNN-Based Chinese NER with Lexicon Rethinking. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence. 4982--4988.
[14]
Jizhou Huang, Haifeng Wang, Miao Fan, An Zhuo, and Ying Li. 2020 a. Personalized Prefix Embedding for POI Auto-Completion in the Search Engine of Baidu Maps. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2677--2685.
[15]
Jizhou Huang, Haifeng Wang, Yibo Sun, Miao Fan, Zhengjie Huang, Chunyuan Yuan, and Yawen Li. 2021. HGAMN: Heterogeneous Graph Attention Matching Network for Multilingual POI Retrieval at Baidu Maps. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3032--3040.
[16]
Jizhou Huang, Haifeng Wang, Wei Zhang, and Ting Liu. 2020 b. Multi-Task Learning for Entity Recommendation and Document Ranking in Web Search. ACM Transactions on Intelligent Systems and Technology (TIST), Vol. 11, 5 (2020), 1--24.
[17]
Jizhou Huang, Wei Zhang, Shiqi Zhao, Shiqiang Ding, and Haifeng Wang. 2017. Learning to Explain Entity Relationships by Pairwise Ranking with Convolutional Neural Networks. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 4018--4025.
[18]
Jizhou Huang, Shiqi Zhao, Shiqiang Ding, Haiyang Wu, Mingming Sun, and Haifeng Wang. 2016. Generating Recommendation Evidence Using Translation Model. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. 2810--2816.
[19]
Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv preprint arXiv:1508.01991 (2015).
[20]
Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 1746--1751.
[21]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations.
[22]
John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of the Eighteenth International Conference on Machine Learning. 282--289.
[23]
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition. In The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 260--270.
[24]
Xuezhe Ma and Eduard H. Hovy. 2016. End-to-End Sequence Labeling via Bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 1064--1074.
[25]
Lakshmi Narayana Mummidi and Krumm John. 2008. Discovering Points of Interest From Users' Map Annotations. GeoJournal, Vol. 72, 3 (August 2008), 215--227.
[26]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 1532--1543.
[27]
Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2227--2237.
[28]
Adam Rae, Vanessa Murdock, Adrian Popescu, and Hugues Bouchard. 2012. Mining the Web for Points of Interest. In The 35th International ACM SIGIR conference on research and development in Information Retrieval. 711--720.
[29]
Lev-Arie Ratinov and Dan Roth. 2009. Design Challenges and Misconceptions in Named Entity Recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning. 147--155.
[30]
Jé rô me Revaud, Matthijs Douze, and Cordelia Schmid. 2012. Correlation-based Burstiness for Logo Retrieval. In Proceedings of the 20th ACM Multimedia Conference. 965--968.
[31]
Jé rô me Revaud, Minhyeok Heo, Rafael S. Rezende, Chanmi You, and Seong-Gyun Jeong. 2019. Did It Change? Learning to Detect Point-Of-Interest Changes for Proactive Map Updates. In IEEE Conference on Computer Vision and Pattern Recognition. 4086--4095.
[32]
Stefan Romberg, Lluis Garcia Pueyo, Rainer Lienhart, and Roelof van Zwol. 2011. Scalable Logo Recognition in Real-world Images. In Proceedings of the 1st International Conference on Multimedia Retrieval. 25.
[33]
Michele Ruta, Floriano Scioscia, Saverio Ieva, Giuseppe Loseto, and Eugenio Di Sciascio. 2012. Semantic Annotation of OpenStreetMap Points of Interest for Mobile Discovery and Navigation. In 2012 IEEE First International Conference on Mobile Services. 33--39.
[34]
Michael Sejr Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling Relational Data with Graph Convolutional Networks. In The Semantic Web - 15th International Conference (Lecture Notes in Computer Science), Vol. 10843. 593--607.
[35]
Hang Su, Shaogang Gong, and Xiatian Zhu. 2017a. WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web. In 2017 IEEE International Conference on Computer Vision Workshops. 270--279.
[36]
Hang Su, Xiatian Zhu, and Shaogang Gong. 2017b. Deep Learning Logo Detection with Data Expansion by Synthesising Context. In 2017 IEEE Winter Conference on Applications of Computer Vision. 530--539.
[37]
Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Danxiang Zhu, Hao Tian, and Hua Wu. 2019. ERNIE: Enhanced Representation through Knowledge Integration. ArXiv, Vol. abs/1904.09223 (2019).
[38]
Fangzhao Wu, Junxin Liu, Chuhan Wu, Yongfeng Huang, and Xing Xie. 2019. Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation. In The World Wide Web Conference. 3342--3348.
[39]
Canwen Xu, Jing Li, Xiangyang Luo, Jiaxin Pei, Chenliang Li, and Donghong Ji. 2019. DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets. In The World Wide Web Conference. 3391--3397.
[40]
Meng Zhou, Ming Wang, and Qingwu Hu. 2013. A POI Data Update Approach based on Weibo Check-in Data. In 21st International Conference on Geoinformatics. 1--4.

Cited By

View all

Index Terms

  1. GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
    October 2021
    4966 pages
    ISBN:9781450384469
    DOI:10.1145/3459637
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. baidu maps
    2. geographic-enhanced sequence tagging
    3. joint poi and accessibility extraction
    4. pre-trained model

    Qualifiers

    • Research-article

    Conference

    CIKM '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)39
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 06 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media