A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data
Abstract
:1. Introduction
2. Architecture of the GSISSP
3. Organization of Multiple-Classification Content Based on Geologic Domain Ontology
3.1. Organization Patterns of Complex Geological Unstructured Content
3.2. Storage Pattern of Complex Geological Unstructured Content
4. Fragmented and Diversified Content Discovery
4.1. Question-Oriented Content Retrieval Framework
- Domain experts build and adjust the geological thematic ontology, geological temporal ontology and retrieval frame ontology;
- We convert the original unstructured data to content items and extract the features of each item, and then we reorganize the content items to HBase and re-store the spatial data to HDFS.
- Submit a specific class of a geological question according to the retrieval portal, for example, the volcanic activity in Xinjiang, China;
- Key information is extracted from the proposed question, such as the question type, target area, or target theme;
- Find the corresponding rules from the predefined SWRL database according to the question type;
- The SWRL rules are input into the inference machine;
- The inference machine and SWRL rules bind with the geological domain ontology to start thematic reasoning and discovery;
- If related content indices are discovered from the ontology, then the source unstructured data would be retrieved from HBase; if spatial data indices are discovered, then the source spatial data would be selected from HDFS and added to the GIS platform to facilitate the spatial analytical work. After related map documents or unstructured content are retrieved from the data source, the map document would be published as a map service on the IGServer platform;
- Newly published map services are associated with the Web Service Ontology;
- The inference machine and SWRL rules bind with the retrieval frame ontology to start the work flow reasoning and discovery, and then the conceptual work flow is built and expressed in a specific format;
- The conceptual workflow is transmitted to the workflow engine and the spatial analytical work is initiated;
- The spatial analytical results are transferred to the visualization tools in the application layer;
- Visualization tools render the original result and show the final analytical result.
4.2. Ontology Design
4.2.1. Geological Domain Ontology
4.2.2. Retrieval Frame Ontology
- Question Ontology:The question ontology defines the conceptual model of the geological question. All the geological questions must contain two elements: the question type and the target content. The question ontology is associated with the question type ontology and data ontology; the objective is to verify both the geological question type and the related data content that are queried by the users. To obtain knowledgeable or semantic information that is related to the target content, the question ontology is associated with the geological domain ontology to more comprehensively render the information discovery.
- Question Type Ontology:The question type ontology describes the concrete question types. Two main types of questions exist in the current framework: spatial-related questions and non-spatial-related questions. Different spatial analysis operations must be employed to answer a spatial-related question according to the different questions. For example, whatNear and whereIn are two spatial-related questions, and the whatNear type of question can be semantically expressed as “what is near a target area”. This type of question semantically implies a buffer analysis operation. The whereIn type of question can be semantically expressed as “where are the objects in the target area”. This type of question semantically implies an overlay analysis operation. Spatial questions are associated with the spatial analysis ontology to discover the related spatial analysis operations. Non-spatial-related questions are different from spatial-related questions. To answer these questions, the target content must be retrieved from the database without undertaking a spatial analysis. Consequently, non-spatial questions do not have to be associated with the spatial analysis ontology.
- Geological Domain Ontology:The geological domain ontology provides the domain knowledge and information to achieve knowledgeable and semantic discoveries in the retrieval framework. According to the geological domain ontology, the keywords that are contained in the retrieval question can be obtained. In addition, the equivalent words and hyponyms of these keywords can also be obtained according to the relations that are contained in the geological domain ontology. During the retrieval, equivalent words and hyponym words will also be retrieved, which could guarantee comprehensiveness with respect to the semantics and knowledge. In the geological domain ontology, knowledgeable information is obtained by invoking a related web service, so the geological domain ontology is associated with the web service ontology.
- Data Ontology:The data ontology describes the involved data content contained in the geological questions. There are two types of data in the framework: spatial data and unstructured data. Although spatial data contain multiple types of geological thematic data, unstructured data contain various geological pictures and text. All the data access operations are implemented by a web service, so the data ontology is associated with the web service ontology.
- Spatial Analysis Ontology:The spatial analysis ontology describes all of the common spatial analysis functions, and it semantically annotates the associated input, output and algorithm model of these spatial analysis functions. The spatial analysis ontology is connected to the process ontology.
- Process Ontology:The process ontology describes the execution standard of the spatial analysis web services and annotates the input and output of these services. This ontology is connected to the related spatial analysis services.
- GIS Data Model Ontology:The GIS data model ontology describes the organization model of the spatial data. All of the spatial processing work will finally be concentrated on the feature processing, as each spatial feature contains attributes, references and geometries. Here, we utilize the spatial data organization model of MapGIS 10, and the GIS data model ontology inherits the ontology in the GSICCP [49].
- Content Discovery Ontology:The content discovery ontology describes the process function that does not involve spatial data, and it currently includes image discovery and text discovery functions. This ontology is implemented by a web service and is connected to the related web services.
- Web Service Ontology:The web service ontology describes all of the web services in the GSICCP, including spatial-related web services and non-spatial-related web services. Although the spatial-related web services mainly consist of the OGC service (WPS, WFS, etc.) and the IGServer service (the map document service, tile map service, vector layer service, etc.), non-spatial-related web services mainly consist of unstructured content discovery services and some data-mining services.
- Rule 1: define a formal question composition, which contains a question type and required data.
- Rule 2: define a HowMany question; a question that is marked with “HowMany” could be regarded as a “HowMany” question.
- Rule 3: define the execution sequence of the HowMany question, which requires an overlay and spatial query operation; the spatial query operation follows the overlay operation.
- Rule 4: define the function’s connections; the output of the previous function is the input of its subsequent functions.
- Rule 5: define the requirement of identical data; two identical spatial data set must have identical geometries and spatial reference systems.
- Rule1:hasQuestionType(?type) ∧ hasData(?type) →Question(?type)
- Rule2:HasQuestionType(“How Many”) ∧ hasData?(?type) →HowManyType(?type)
- Rule3:HowManyType(?type) ∧ Overlay(?overlay) ∧ SpatialQuery(?spatialQuery) ∧hasGISFunction(?type, ?overlay) ∧ hasGISFunction(?type, ?spatialQuery) ∧hasNextFunction(?overlay, ?spatialQuery) →howMany(?type)
- Rule4:GISFunction(?func1) ∧ Output(?out1) ∧ hasOutput(?func1, ?out1) ∧GISFunction(?func2) ∧ Input(?in1) ∧ hasInput(?func2, ?in1) ∧hasSameData(?out1, ?in1) →hasNextFunction(?func1, ?func2)
- Rule5:FeatureCollection(?data1) ∧ FeatureCollection(?data2) ∧ Geometry(?geo1) ∧Geometry(?geo2) ∧ hasGeometryType(?data1, ?geo1) ∧ hasGeometryType(?data2, ?geo2) ∧hasSRS(?geo1, ?srs1) ∧ hasSRS(?geo2, ?srs2) ∧ sameAs(?geo1, ?geo2) ∧ sameAs(?srs1, ?srs2) →hasSameData(?data1, ?data2)
4.3. Feature Extraction Based on the Geological Domain Ontology
4.3.1. Thematic Feature Extraction
4.3.2. Spatiotemporal Feature Extraction
4.4. Multi-Feature Content Association
5. Use Case
- Rule1:hasQuestionType(“WhereIn”) ∧ hasData(?type) →WhereIn(?type)
- Rule2:WhereIn(?type) ∧ Overlay(?overlay) ∧ SpatialQuery(?spatialQuery) ∧hasGISFunction(?type, ?overlay) ∧ hasGISFunction(?type, ?spatialQuery) ∧hasNextFunction(?overlay, ?spatialQuery) →whereIn(?type)
6. Discussion
- Massive unstructured complex geological data organizationIn this study, we aimed to characterize geological survey data (e.g., massive, complex, diversified, or unstructured) and enhanced the data-management methods with Hadoop ecosystem technologies. HDFS was utilized to store the original geological content. Instead of using a traditional file system or relational database, the original geological content in HDFS had multiple copies. Compared to traditional file-based storage, the use of the distributed architecture of the Hadoop system technology may have improved the security of the original data and improved the concurrent data access efficiency. In addition, massive geological unstructured data were split into fragments, and a NoSQL database was utilized to reorganize these fragmented contents. Split unstructured data could help reduce the complexity and heterogeneity of original geological data. Therefore, we fully exploited the NoSQL database features to efficiently manage the fragmented geological content, which improved the retrieval and computational efficiency of the geological unstructured data.
- Constructing geological knowledge relationshipsIn this paper, we addressed the thematic, spatial and temporal features of the geological domain, introduced a geological thematic ontology and geological temporal ontology, built a toponymy ontology, and extracted multi-dimensional features from geological unstructured data. The geological domain problem usually contains spatiotemporal characteristics, and the extracted thematic, spatial and temporal features would be advantageous to geological content discovery work that considers spatiotemporal attributes. Moreover, associations between the fragments, spatial data and images were built based on the relationships that were represented in the ontologies and the extracted features, and these associations subsequently produced intelligent and semantically driven connections among the geological content. This set of associations built the foundation for knowledge-driven geological content discovery.
- Knowledge-driven geological content discoveryMany unstructured content retrieval studies have been performed in the past. However, in our previous work, we discovered geological content according to keywords in the text by extracting words from the text and building an index between the words and the content. When a query request is sent, the index of keywords is used to identify all the fragments that contain the keywords. However, this strategy has a huge gap: keyword-based searching ignores semantic and knowledge relationships. Thus, certain concepts that have the same semantics but are expressed in different forms are lost, which could lead to missing content during the retrieval process. For example, to find “magmatic” content, the keyword-based searching method finds all the fragments that contain the string “magmatic”. However, many sub-categories under the concept of “magmatic”, such as basalt, dacite, and tuff, exist in the geological domain. Although these concepts also belong to the category of “magmatic” in the geological domain ontology, they do not contain the keyword “magmatic” in their expressions; thus, fragments that contain these concepts are not discovered.
7. Future work
Acknowledgments
Author Contributions
Conflicts of Interest
References
- O’Driscoll, A.; Daugelaite, J.; Sleator, R.D. “Big data”, hadoop and cloud computing in genomics. J. Biomed. Inform. 2013, 46, 774–781. [Google Scholar] [CrossRef] [PubMed]
- Evangelidis, K.; Ntouros, K.; Makridis, S.; Papatheodorou, C. Geospatial services in the cloud. Comput. Geosci. 2014, 63, 116–122. [Google Scholar] [CrossRef]
- Sharma, S. Expanded cloud plumes hiding big data ecosystem. Future Gener. Comput. Syst. 2016, 59, 63–92. [Google Scholar] [CrossRef]
- Yang, C.; Yu, M.; Hu, F.; Jiang, Y.; Li, Y. Utilizing cloud computing to address big geospatial data challenges. Comput. Environ. Urban Syst. 2017, 61, 120–128. [Google Scholar] [CrossRef]
- Linh Manh, P.; El-Rheddane, A.; Donsez, D.; de Palma, N. Cirus: An elastic cloud-based framework for ubilytics. Ann. Telecommun. 2016, 71, 133–140. [Google Scholar]
- Vera-Baquero, A.; Colomo-Palacios, R.; Molloy, O. Real-time business activity monitoring and analysis of process performance on big-data domains. Telemat. Inform. 2016, 33, 793–807. [Google Scholar] [CrossRef]
- Wylot, M.; Cudre-Mauroux, P. Diplocloud: Efficient and scalable management of rdf data in the cloud. IEEE Trans. Knowl. Data Eng. 2016, 28, 659–674. [Google Scholar] [CrossRef]
- Xia, J.; Yang, C.; Liu, K.; Li, Z.; Sun, M.; Yu, M. Forming a global monitoring mechanism and a spatiotemporal performance model for geospatial services. Int. J. Geogr. Inf. Sci. 2015, 29, 375–396. [Google Scholar] [CrossRef]
- Giachetta, R. A framework for processing large scale geospatial and remote sensing data in mapreduce environment. Comput Graph. 2015, 49, 37–46. [Google Scholar] [CrossRef]
- Oweis, N.E.; Owais, S.S.; George, W.; Suliman, M.G.; Snasel, V. A survey on big data, mining: (tools, techniques, applications and notable uses). In Intelligent Data Analysis and Applications; Abraham, A., Jiang, X.H., Snasel, V., Pan, J.S., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 370, pp. 109–119. [Google Scholar]
- Lomotey, R.K.; Deters, R. Towards knowledge discovery in big data. In Proceedings of the IEEE 8th International Symposium on Service Oriented System Engineering, Oxford, UK, 7–11 April 2014; IEEE: Oxford, UK, 2014; pp. 181–191. [Google Scholar]
- Kim, G.-H.; Trimi, S.; Chung, J.-H. Big-data applications in the government sector. Commun. ACM 2014, 57, 78–85. [Google Scholar] [CrossRef]
- Yang, C.; Huang, Q.; Li, Z.; Liu, K.; Hu, F. Big data and cloud computing: Innovation opportunities and challenges. Int. J. Digit. Earth 2017, 10, 13–53. [Google Scholar] [CrossRef]
- Bhogal, J.; Choksi, I. Handling big data using NoSQL. In Proceedings of the 29th IEEE International Conference on Advanced Information Networking and Applications Workshops, Gwangju, Korea, 24–27 March 2015; IEEE: Gwangju, Korea, 2015; pp. 393–398. [Google Scholar]
- Lomotey, R.K.; Deters, R. Terms mining in document-based NoSQL: Response to unstructured data. In Proceedings of the 3rd IEEE International Congress on Big Data, BigData Congress, Anchorage, AK, USA, 27 June–2 July 2014; IEEE: Anchorage, AK, USA, 2014; pp. 661–668. [Google Scholar]
- Mazurek, M. Applying nosql databases for operationalizing clinical data mining models. In Beyond Databases, Architectures and Structures; Kozielski, S., Mrozek, D., Kasprowski, P., MalysiakMrozek, B., Kostrzewa, D., Eds.; Springer: Berlin, Germany, 2014; Volume 424, pp. 527–536. [Google Scholar]
- Lomotey, R.K.; Deters, R. Unstructured data extraction in distributed NoSQL. In Proceedings of the 7th IEEE International Conference on Digital Ecosystems and Technologies: Smart Planet and Cyber Physical Systems as Embodiment of Digital Ecosystems, Menlo Park, CA, USA, 24–26 July 2013; IEEE: Menlo Park, CA, USA, 2013; pp. 160–165. [Google Scholar]
- Lomotey, R.K.; Deters, R. Topics and terms mining in unstructured data stores. In Proceedings of the IEEE 16th International Conference on Computational Science and Engineering, Sydney, Australia, 3–5 December 2013; Chen, J., Cuzzocrea, A., Yang, L.T., Eds.; IEEE: Sydney, Australia, 2013; pp. 854–861. [Google Scholar]
- Lomotey, R.K.; Deters, R. Real-time effective framework for unstructured data mining. In Proceedings of the 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, Melbourne, Australia, 16–18 July 2013; pp. 1081–1088. [Google Scholar]
- Witayangkurn, A.; Horanont, T.; Shibasaki, R. The design of large scale data management for spatial analysis on mobile phone dataset. Asian J. Geoinform. 2013, 13, 3. [Google Scholar]
- Zhao, J.; Wang, L.; Tao, J.; Chen, J.; Sun, W.; Ranjan, R.; Kołodziej, J.; Streit, A.; Georgakopoulos, D. A security framework in g-hadoop for big data computing across distributed cloud data centres. J. Comput. Syst. Sci. 2014, 80, 994–1007. [Google Scholar] [CrossRef]
- Zhong, Y.; Han, J.; Zhang, T.; Li, Z.; Fang, J.; Chen, G. Towards parallel spatial query processing for big spatial data. In Proceedings of the 2012 IEEE 26th International on Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), Shanghai, China, 21–25 May 2012; pp. 2085–2094. [Google Scholar]
- Aji, A.; Wang, F.; Vo, H.; Lee, R.; Liu, Q.; Zhang, X.; Saltz, J. Hadoop gis: A high performance spatial data warehousing system over mapreduce. Proc. VLDB Endow. 2013, 6, 1009–1020. [Google Scholar] [CrossRef]
- Eldawy, A.; Mokbel, M.F. A demonstration of spatialhadoop: An efficient mapreduce framework for spatial data. Proc. VLDB Endow. 2013, 6, 1230–1233. [Google Scholar] [CrossRef]
- Zou, Z.Q.; Wang, Y.; Cao, K.; Qu, T.S.; Wang, Z.M. Semantic overlay network for large-scale spatial information indexing. Comput. Geosci. 2013, 57, 208–217. [Google Scholar] [CrossRef]
- Verma, V.K.; Ranjan, M.; Mishra, P. Text mining and information professionals role, issues and challenges. In Proceedings of the 4th International Symposium on Emerging Trends and Technologies in Libraries and Information Services (Ettlis), Noida, India, 6–8 January 2015; pp. 133–137. [Google Scholar]
- Sirsat, S.R.; Chavan, V.; Deshpande, S.P. Mining knowledge from text repositories using information extraction: A review. Sadhana 2014, 39, 53–62. [Google Scholar] [CrossRef]
- Abu, A.; Susan, L.L.H.; Sidhu, A.S.; Dhillon, S.K. Semantic representation of monogenean haptoral bar image annotation. BMC Bioinform. 2013, 14, 48. [Google Scholar] [CrossRef] [PubMed]
- Kuo, C.L.; Hong, J.H. Interoperable cross-domain semantic and geospatial framework for automatic change detection. Comput. Geosci. 2016, 86, 109–119. [Google Scholar] [CrossRef]
- Stock, K.; Stojanovic, T.; Reitsma, F.; Ou, Y.; Bishr, M.; Ortmann, J.; Robertson, A. To ontologise or not to ontologise: An information model for a geospatial knowledge infrastructure. Comput. Geosci. 2012, 45, 98–108. [Google Scholar] [CrossRef]
- Cruz, S.A.B.; Monteiro, A.M.V.; Santos, R. Automated geospatial web services composition based on geodata quality requirements. Comput. Geosci. 2012, 47, 60–74. [Google Scholar] [CrossRef]
- Li, W.; Yang, C.; Nebert, D.; Raskin, R.; Houser, P.; Wu, H.; Li, Z. Semantic-based web service discovery and chaining for building an arctic spatial data infrastructure. Comput. Geosci. 2011, 37, 1752–1762. [Google Scholar] [CrossRef]
- Jung, C.-T.; Sun, C.-H.; Yuan, M. An ontology-enabled framework for a geospatial problem-solving environment. Comput. Environ. Urban Syst. 2013, 38, 45–57. [Google Scholar] [CrossRef]
- Xiao, C.; Chen, N.; Wang, X.; Chen, Z. A semantic registry method using sensor metadata ontology to manage heterogeneous sensor information in the geospatial sensor web. ISPRS Int. J. Geo-Inf. 2016, 5, 63. [Google Scholar] [CrossRef]
- Hu, Y.; Janowicz, K.; Prasad, S.; Gao, S. Enabling semantic search and knowledge discovery for arcgis online: A linked-data-driven. In Proceedings of the 18th AGILE International Conference on Geographic Information Science, AGILE 2015, Lisbon, Portugal, 9–12 June 2015; Kluwer Academic Publishers: Lisbon, Portugal, 2015; pp. 107–124. [Google Scholar]
- Ganesan, V.; Waheeta, H.S.; Srimathi, H. Jena with sparql to find indian natural plants used as medicine for diseases. In Proceedings of the International Conference on Internet Computing and Information Communications, Chennai, India, 12–14 February 2012; Sathiakumar, S., Awasthi, L.K., Masillamani, M.R., Sridhar, S.S., Eds.; Springer: Berlin, Germany, 2014; pp. 225–237. [Google Scholar]
- Alves, M.B.; Damasio, C.V.; Correia, N. Sparql commands in jena rules. In Proceedings of the 6th International Conference Knowledge Engineering and Semantic Web, KESW 2015, Moscow, Russia, 30 September–2 October 2015; Klinov, P., Mouromtsev, D., Eds.; Springer: Moscow, Russia, 2015; pp. 253–262. [Google Scholar]
- Thangsupachai, N.; Niwattanakul, S.; Chamnongsri, N. Learning object metadata mapping for linked open data. In Emergence of Digital Libraries—Research and Practices; Tuamsuk, K., Jatowt, A., Rasmussen, E., Eds.; Springer International Publishing: Cham, Switzerland, 2014; Volume 8839, pp. 122–129. [Google Scholar]
- Chebotko, A.; Lu, S.; Fei, X.; Fotouhi, F. Rdfprov: A relational rdf store for querying and managing scientific workflow provenance. Data Knowl. Eng. 2010, 69, 836–865. [Google Scholar] [CrossRef]
- Giordano, D.; Maiorana, F. Learning about the semantic web in an information systems oriented curriculum: A case study. In Computer Supported Education; Zvacek, S., Restivo, M.T., Uhomoibhi, J., Helfert, M., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 510, pp. 242–257. [Google Scholar]
- Jang, B.; Ha, Y.-G. Transitivity reasoning for rdf ontology with iterative mapreduce. In Proceedings of the Seventh International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, Taichung, Taiwan, 3–5 July 2013; IEEE: Taichung, Taiwan, 2013; pp. 232–237. [Google Scholar]
- Liu, S.; Zhang, F.; Zhai, Z. Modeling and discovering data services over sparql services. In Proceedings of the IEEE World Congress on Services, Anchorage, AK, USA, 27 June–2 July 2014; Zhang, L.J., Bahsoon, R., Eds.; IEEE: Anchorage, AK, USA, 2014; pp. 169–173. [Google Scholar]
- Jing, Y.; Jeong, D.; Baik, D.-K. Sparql graph pattern rewriting for owl-dl inference queries. Knowl. Inf. Syst. 2009, 20, 243–262. [Google Scholar] [CrossRef]
- Song Wan, L.; Ni Li, X. Semantic query and reasoning system based on domain ontology. In Proceedings of the 2015 International Symposium on Computers & Informatics, Beijing, China, 17–18 January 2015; Liang, H., Wang, W., Eds.; Atlantis Press: Amsterdam, The Netherlands, 2015; pp. 2524–2531. [Google Scholar]
- Christodoulou, G.; Petrakis, E.G.M.; Batsakis, S. Qualitative spatial reasoning using topological and directional information in owl. In Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence, Athens, Greece, 7–9 November 2012; IEEE: Athens, Greece, 2012; pp. 596–602. [Google Scholar]
- Herrero-Zazo, M.; Segura-Bedmar, I.; Hastings, J.; Martinez, P. Dinto: Using owl ontologies and swrl rules to infer drug-drug interactions and their mechanisms. J. Chem. Inf. Model. 2015, 55, 1698–1707. [Google Scholar] [CrossRef] [PubMed]
- Orlando, J.P.; Musen, M.A.; Moreira, D.A. User extensible system to identify problems in owl ontologies and swrl rules. In Rule Technologies: Foundations, Tools, and Applications; Bassiliades, N., Gottlob, G., Sadri, F., Paschke, A., Roman, D., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 9202, pp. 112–126. [Google Scholar]
- Cantone, D.; Longo, C.; Nicolosi-Asmundo, M.; Santamaria, D.F. Web ontology representation and reasoning via fragments of set theory. In Web Reasoning and Rule Systems; TenCate, B., Mileo, A., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 9209, pp. 61–76. [Google Scholar]
- Wu, L.; Xue, L.; Li, C.; Lv, X.; Chen, Z.; Guo, M.; Xie, Z. A geospatial information grid framework for geological survey. PLoS ONE 2015, 10, e0145312. [Google Scholar] [CrossRef] [PubMed]
- White, T. Meet hadoop. In Hadoop—The Definitive Guide; Tsinghua University Press: Beijing, China, 2010; pp. 1–11. [Google Scholar]
- White, T. The hadoop distributed filesystem. In Hadoop—The Definitive Guide; Tsinghua University Press: Beijing, China, 2010; pp. 44–79. [Google Scholar]
- White, T. Zookeeper. In Hadoop—The Definitive Guide; Tsinghua University Press: Beijing, China, 2010; pp. 394–430. [Google Scholar]
- George, L. Introduction. In Hbase—The Definitive Guide; POST & TELECOM PRESS: Beijing, China, 2013; pp. 1–26. [Google Scholar]
- George, L. Advanced usage. In Hbase—The Definitive Guide; POST & TELECOM PRESS: Beijing, China, 2013; pp. 339–365. [Google Scholar]
- Gruber, T.R. A translation approach to portable ontology specifications. Knowl. Acquis. 1993, 5, 199–220. [Google Scholar] [CrossRef]
- Neches, R.; Fikes, R.E.; Finin, T.; Gruber, T.; Patil, R.; Senator, T.; Swartout, W.R. Enabling technology for knowledge sharing. AI Mag. 1991, 12, 36. [Google Scholar]
- Giaretta, P.; Guarino, N. Ontologies and knowledge bases towards a terminological clarification. In Towards Very Large Knowledge Bases: Knowledge Building & Knowledge Sharing; Mars, N., Ed.; IOS Press: Amsterdam, The Netherlands, 1995; Volume 25, p. 32. [Google Scholar]
- An, Y.; Zhao, B. Geo Ontology Design and Comparison in Geographic Information Integration; IEEE Computer Society: Washington, DC, USA, 2007; pp. 608–612. [Google Scholar]
- Zhong, J.; Aydina, A.; McGuinness, D.L. Ontology of fractures. J. Struct. Geol. 2009, 31, 251–259. [Google Scholar] [CrossRef]
- Li, C.; Song, M.; Lv, X.; Luo, X.; Li, J. The spatial data sharing mechanisms of geological survey information grid in p2p mixed network systems network architecture model. In Proceedings of the 2010 9th International Conference on Grid and Cooperative Computing (GCC), Nanjing, China, 1–5 November 2010; pp. 258–263. [Google Scholar]
- Li, C. Geological domain ontology and its application. In China Geological Survey Information Grid—Technology & Methodology; Geological Publishing House: Beijing, China, 2013; pp. 27–51. [Google Scholar]
- Li, C. The technical infrastructure of geological survey information grid. In Proceedings of the 2010 18th International Conference on Geoinformatics, Beijing, China, 18–20 June 2010. [Google Scholar]
Document Name | Content Description | Count | Data Type | Data Features |
---|---|---|---|---|
Achievement reports | Regional geological survey theme reports (origin, evolution, working methods, etc.), results summaries | 4 | a doc | Unstructured |
Achievement Illustrations | Geological maps, mineral maps, environmental geological maps | 3 | b gis | Structured |
Acceptance Documentation | Final evaluation result reports, mid-term evaluation result reports, wild acceptance result reports | 9 | doc | Unstructured |
Field Book | Wild route record, measured profile record | 78 | c xml, d jpg, doc, gis | Unstructured |
Editorial Images | Field draft, comprehensive draft, factual material, primitive maps for compilation | 31 | jpg, gis | Unstructured |
Image and Interpretation | Remote sensing imagery, aerial survey interpretations | 11 | gis, e tiff, jpg | Unstructured |
Geological Section | Geological section tables, stratum section column | 54 | gis, doc, f xls | Unstructured |
Specimens | Ore mineral, spectrum, silicate, tombarthite | 35 | doc, gis | Structured |
Quality Check | Geological data quality check card (geotraverse, section, etc.), raw material inspection records, raw material inspections | 22 | doc, xls | Structured |
Measurement Report | Rock authentication reports, fossil authentication reports, tombarthite analysis report | 21 | doc, xls | Unstructured, Structured |
Photos | Geological pictures | 5 | jpg | Unstructured |
Designing Files | Overall design, geological mineral draft, project design, compilation note | 7 | doc, gis | Unstructured, Structured |
Table name: Archive_Info | Row Key: MD5 of Geological Archive | ||
Column family0: basic_info | Column_key | Column name | |
arch_id | Archive id | ||
create_time | Upload time | ||
authority | Access authority | ||
user | User name | ||
size | File size | ||
name | Archive name | ||
title_ml_a [FilePath] | [Document title] |
Table name: Basic_Content | Row Key: MD5 of document + offset of fragment | ||
Column family0: feature_info | Column_key | Column name | |
frag_content | Fragment content | ||
theme_features | Thematic feature concepts | ||
time_features | Temporal feature concepts | ||
coordinate_info | Coordinates | ||
geo_name | Toponymy | ||
map_doc_a [FilePath] | Source file of map document | ||
breviary_img_[FilePath] | [Thumbnail] | ||
original_img_[FilePath] | [Image original file] |
Group | Item | Method | a R | b H | c N | d S | e Precision | f Recall |
---|---|---|---|---|---|---|---|---|
1 | Volcanic rocks | Lucene | 51 | 39 | 48 | 100 | 76.47% | 81.25% |
2 | Semantic | 67 | 46 | 48 | 100 | 68.66% | 95.83% | |
1 | Metamorphic rocks | Lucene | 42 | 26 | 33 | 100 | 61.90% | 78.77% |
2 | Semantic | 40 | 29 | 33 | 100 | 72.50% | 87.88% | |
1 | Arjin | Lucene | 59 | 43 | 57 | 100 | 72.88% | 75.44% |
2 | Semantic | 74 | 56 | 57 | 100 | 75.68% | 98.25% |
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, L.; Xue, L.; Li, C.; Lv, X.; Chen, Z.; Jiang, B.; Guo, M.; Xie, Z. A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data. ISPRS Int. J. Geo-Inf. 2017, 6, 166. https://doi.org/10.3390/ijgi6060166
Wu L, Xue L, Li C, Lv X, Chen Z, Jiang B, Guo M, Xie Z. A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data. ISPRS International Journal of Geo-Information. 2017; 6(6):166. https://doi.org/10.3390/ijgi6060166
Chicago/Turabian StyleWu, Liang, Lei Xue, Chaoling Li, Xia Lv, Zhanlong Chen, Baode Jiang, Mingqiang Guo, and Zhong Xie. 2017. "A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data" ISPRS International Journal of Geo-Information 6, no. 6: 166. https://doi.org/10.3390/ijgi6060166
APA StyleWu, L., Xue, L., Li, C., Lv, X., Chen, Z., Jiang, B., Guo, M., & Xie, Z. (2017). A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data. ISPRS International Journal of Geo-Information, 6(6), 166. https://doi.org/10.3390/ijgi6060166