Earthquake Information Extraction and Comparison from Different Sources Based on Web Text
Abstract
:1. Introduction
2. Materials and Methods
2.1. Data and Pre-Processing
2.2. Named Entity Recognition
2.2.1. Earthquake Information Extraction Rules
2.2.2. Information Extraction
2.3. Geocoding
2.4. Kernel Density Estimation
2.5. Evaluation Approach
3. Results
3.1. Results Description
3.2. Statistical Analysis
3.3. Kernel Density Estimation
4. Discussion
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- People’s Daily Online. The Big Data Analysis of the Public Opinion 72 Hours after the Jiuzhaigou Earthquake. Available online: http://yuqing.people.com.cn/n1/2017/0815/c209043-29471816.html (accessed on 15 August 2017). (In Chinese).
- Shin, S.; Hong, S.; Song, S. An Efficient Damage Information Extraction from Government Disaster Report. J. Int. Comput. Serv. 2017, 18, 55–63. [Google Scholar]
- Téllez, V.A.; Manuel, M.Y.G.; Villaseñor, P.L. Using Machine Learning for Extracting Information from Natural Disaster News Reports. Comput. Y Sist. 2009, 13, 33–44. [Google Scholar]
- Zhang, C.J. Interpretation of Event Spatio-temporal and Attribute Information in Chinese Text; Nanjing Normal University: Nanjing, China, 2013. (In Chinese) [Google Scholar]
- Wang, W.; Stewart, K. Spatiotemporal and semantic information extraction from Web news reports about natural hazards. Comput. Environ. Urban Syst. 2015, 50, 30–40. [Google Scholar] [CrossRef]
- Liu, S.Y. Extracting Landslide Disaster Information from Web Pages; Southwest Jiaotong University: Chengdu, China, 2015. (In Chinese) [Google Scholar]
- Herfort, B.; Brenning, A.; Zipf, A. A geographic approach for combining social media and authoritative data towards identifying useful information for disaster management. Int. J. Geogr. Inf. Sci. 2015, 29, 667–689. [Google Scholar] [Green Version]
- Song, J.G.; Wang, Z.X.; Li, Q.Y.; Ma, S.L.; Lv, J.H. Internet information process oriented to the earthquake response. J. Beijing Univ. Aeronaut. Astronaut. 2017, 43, 1155–1164. (In Chinese) [Google Scholar]
- Yang, J.; Hong, F.; Huaiyuan, L.I. Spatial Information Extraction of Web Seismic Event Based on Geographic Names Semantic Technology. J. Geomat. 2013, 38, 10–13. (In Chinese) [Google Scholar]
- Stewart, K.; Wang, W. Representing dynamic phenomena based on spatiotemporal information extracted from web documents. In Proceedings of the Sixth International Conference on Geographic Information Science, Zurich, Switzerland, 14–17 September 2010. Extended Abstracts. [Google Scholar]
- Fan, H.; Guo, D.; Li, H. Extraction of spatio-temporal information of earthquake event based on semantic technology. In MIPPR 2015: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications; International Society for Optics and Photonics: Bellingham, WA, USA, 2015; Volume 9815, p. 981509. [Google Scholar]
- Li, Z.; Wang, C.; Emrich, C.T.; Guo, D. A novel approach to leveraging social media for rapid flood mapping: A case study of the 2015 South Carolina floods. Cartogr. Geogr. Inf. Sci. 2018, 45, 97–110. [Google Scholar] [CrossRef]
- Wang, Z.; Ye, X.; Tsou, M.H. Spatial, temporal, and content analysis of Twitter for wildfire hazards. Nat. Hazard. 2016, 83, 523–540. [Google Scholar] [CrossRef]
- Shengxiang, W.; Chaoliang, W.; Weixin, Y. Topic time extraction algorithm of Web pages based on hierarchical tree. J. Comput. Appl. 2017, 37 (Suppl. S1), 270–272. (In Chinese) [Google Scholar]
- Chinchor, N. MUC7 Named Entity Task Definition. In Proceedings of the Seventh Message Understanding Conference, Fairfax, VA, USA, 29 April–1 May 1998. [Google Scholar]
- Zheng, J.; Fu, L.; Ma, X.; Fox, P. SEM+: Tool for discovering concept mapping in Earth science related domain. Earth Sci. Inform. 2015, 8, 95–102. [Google Scholar] [CrossRef]
- Okabe, A.; Satoh, T.; Sugihara, K. A kernel density estimation method for networks, its computational method and a GIS-based tool. Int. J. Geogr. Inf. Sci. 2009, 23, 7–32. [Google Scholar] [CrossRef]
- Piskorski, J.; Yangarber, R. Information Extraction: Past, Present and Future. In Multi-source, Multilingual Information Extraction and Summarization; Springer: Berlin/Heidelberg, Germany, 2013; pp. 23–49. [Google Scholar]
- Goodchild, M.F. Citizens as sensors: The world of volunteered geography. Geojournal 2007, 69, 211–221. [Google Scholar] [CrossRef]
- Li, H.J.; Liang, H.B. Natural Disaster Forecasting System Based Association Rules. Comput. Syst. Appl. 2017, 26, 50–55. (In Chinese) [Google Scholar]
Temporal Type | Expressions | Examples |
---|---|---|
Absolute time | Year/Month/Day/Hour/Minute | “At 21:27 on 21 May 2018” |
Month/Day/Hour/Minute | “At 11:09 on May 19” | |
Day/Hour/Minute | “At 21:10 on first” | |
Relative time | “today,” “afternoon” “morning,” “today” | “1:55 this afternoon,” “8:48:22 in the morning,” “2:39 this morning” |
Temporal Type | Extraction Rules | Examples |
---|---|---|
Absolute time | \d{4}year\d{1,2}month\d{1,2}day\d{1,2}hour\d{1,2}minute \d{1,2}month\d{1,2}day\d{1,2}hour\d{1,2}minute \d{1,2}day\d{1,2}hour\d{1,2}minute | Year/Month/Day/Hour/Minute Month/Day/Hour/Minute Day/Hour/Minute |
Relative time | Today|afternoon|morning|today | Today, afternoon, morning, today |
Location Type | Trigger Words | Examples |
---|---|---|
Locations related to earthquake | “latitude,” “north latitude” “longitude,” “east longitude,” “occur” | China news service, February 9 2018, (reporter Dong Fei), according to China Earthquake Network Center, at 19:00 p.m. on February 9 2018, a 4.3 magnitude earthquake occurred in Xichuan County, Nanyang City, Henan province. There are no reports of casualties for the time being. The Henan Seismological Bureau said on the same night that according to the relevant pre-arranged plans, the second level emergency response should be launched. The epicenter is located in Madeng town (111.60 degrees east longitude, 32.80 degrees north latitude) in the county, with a focal depth of 10 km. |
Locations unrelated to earthquake | “Seismological Bureau,” “Earthquake Networks Center,” “time,” “Networks Center” |
Attribute Type | Trigger Words |
---|---|
Magnitude | “magnitude unit,” “earthquake,” “magnitude,” “Richter,” |
Focal depth | “focal depth,” “depth,” “deep,” “km,” “km,” |
Casualties | “Casualty unit,” “death,” “death,” “Wounded,” “dead body,” “dead,” “dead,” “number” |
Epicenter | “epicenter,” “place,” “located,” “center” |
Levels of Earthquake Events | News Report | Professional Report | CENC |
---|---|---|---|
Ms < 4 | 594 | 2 | 1449 |
4.0 ≤ Ms < 5.0 | 189 | 104 | 308 |
5.0 ≤ Ms < 6.0 | 51 | 20 | 65 |
6.0 ≤ Ms < 7.0 | 15 | 6 | 15 |
Ms ≥ 7.0 | 1 | 1 | 1 |
Total | 850 | 133 | 1838 |
News Reports | Professional Reports | |
---|---|---|
Precision (P) | 88.4% | 96.4% |
Recall (R) | 86.4% | 81.6% |
F1 | 87.4% | 88.4% |
Levels of Earthquake | Beijing-Tianjin-Hebei | Taiwan |
---|---|---|
Ms < 4 | 35 | 10 |
4.0 ≤ Ms < 5.0 | 3 | 46 |
5.0 ≤ Ms < 6.0 | 0 | 23 |
6.0 ≤ Ms < 7.0 | 0 | 8 |
Ms ≥ 7.0 | 0 | 0 |
Total | 38 | 87 |
Levels of Earthquake | CENC | News Reports | Professional Reports |
---|---|---|---|
Ms < 4.0 | 564 | 105 | 0 |
4.0 ≤ Ms < 5.0 | 67 | 22 | 27 |
5.0 ≤ Ms < 6.0 | 10 | 8 | 7 |
6.0 ≤ Ms < 7.0 | 4 | 3 | 3 |
Ms ≥ 7.0 | 0 | 0 | 0 |
Total | 645 | 138 | 37 |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Han, X.; Wang, J. Earthquake Information Extraction and Comparison from Different Sources Based on Web Text. ISPRS Int. J. Geo-Inf. 2019, 8, 252. https://doi.org/10.3390/ijgi8060252
Han X, Wang J. Earthquake Information Extraction and Comparison from Different Sources Based on Web Text. ISPRS International Journal of Geo-Information. 2019; 8(6):252. https://doi.org/10.3390/ijgi8060252
Chicago/Turabian StyleHan, Xuehua, and Juanle Wang. 2019. "Earthquake Information Extraction and Comparison from Different Sources Based on Web Text" ISPRS International Journal of Geo-Information 8, no. 6: 252. https://doi.org/10.3390/ijgi8060252
APA StyleHan, X., & Wang, J. (2019). Earthquake Information Extraction and Comparison from Different Sources Based on Web Text. ISPRS International Journal of Geo-Information, 8(6), 252. https://doi.org/10.3390/ijgi8060252