skip to main content
10.1145/775152.775231acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Piazza: data management infrastructure for semantic web applications

Published: 20 May 2003 Publication History

Abstract

The Semantic Web envisions a World Wide Web in which data is described with rich semantics and applications can pose complex queries. To this point, researchers have defined new languages for specifying meanings for concepts and developed techniques for reasoning about them, using RDF as the data model. To flourish, the Semantic Web needs to be able to accommodate the huge amounts of existing data and the applications operating on them. To achieve this, we are faced with two problems. First, most of the world's data is available not in RDF but in XML; XML and the applications consuming it rely not only on the domain structure of the data, but also on its document structure. Hence, to provide interoperability between such sources, we must map between both their domain structures and their document structures. Second, data management practitioners often prefer to exchange data through local point-to-point data translations, rather than mapping to common mediated schemas or ontologies.This paper describes the Piazza system, which addresses these challenges. Piazza offers a language for mediating between data sources on the Semantic Web, which maps both the domain structure and document structure. Piazza also enables interoperation of XML data with RDF data that is accompanied by rich OWL ontologies. Mappings in Piazza are provided at a local scale between small sets of nodes, and our query answering algorithm is able to chain sets mappings together to obtain relevant data from across the Piazza network. We also describe an implemented scenario in Piazza and the lessons we learned from it.

References

[1]
S. Abiteboul and O. Duschka Complexity of answering queries using materialized views. In PODS '98, pages 254--263, Seattle, WA, 1998.]]
[2]
B. Amann, C. Beeri, I. Fundulaki, and M. Scholl Ontology-based integration of XML web resources. In Int'l Semantic Web Conference '02, pages 117--131, 2002.]]
[3]
M. Arenas, L. E. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. PODS'99, pages 68--79, 1999.]]
[4]
T. Berners-Lee, J. Hendler, and O. Lassila. The semantic web. Scientific American, May 2001.]]
[5]
P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In ACM SIGMOD WebDB Workshop '02, June 2002.]]
[6]
S. Boag, D. Chamberlin, M. F. Fernandez, D. Florescu, J. Robie, J. Simeon, and M. Stefanescu. XQuery 1.0: An XML query language. http://www.w3.org/TR/xquery/, 30 April 2002. W3C working draft.]]
[7]
J. Broekstra, A. Kampan, and F. van Harmelen. Sesame: A generic architecture for storing and querying RDF and RDF Schema. In Int'l Semantic Web Conference '02, pages 54--68, 2002.]]
[8]
S. Cluet, P. Veltri, and D. Vodislav. Views in a large scale XML repository. In VLDB '01, pages 271--280, September 2001.]]
[9]
M. Dean, D. Connolly, F. van Harmelen, J. Hendler, I. Horrocks, D. McGuinness, P. Patel-Schneider, and L. Stein. OWL web ontology language 1.0 reference, 2002. Manuscript available from http://www.w3.org/2001/sw/WebOnt/.]]
[10]
A. Deutsch, M. F. Fernandez, D. Florescu, A. Levy, and D. Suciu. A query language for XML. In Eighth International World Wide Web Conference, 1999.]]
[11]
A. Doan, P. Domingos, and A. Y. Halevy. Reconciling schemas of disparate data sources: A machine-learning approach. In SIGMOD '01, 2001.]]
[12]
A. Doan, J. Madhavan, P. Domingos, and A. Halevy. Learning to map between ontologies on the semantic web. In Proc. of the Int. WWW Conf., 2002.]]
[13]
M. Fernandez, W.-C. Tan, and D. Suciu. SilkRoute: Trading between relations and XML. In Ninth International World Wide Web Conference, November 1999.]]
[14]
A. Halevy, O. Etzioni, A. Doan, Z. Ives, J. Madhavan, L. McDowell, and I. Tatarinov. Crossing the structure chasm. In Proceedings of the First Biennian Conference on Innovative Data Systems Research (CIDR), 2003.]]
[15]
A. Y. Halevy. Answering queries using views: A survey. VLDB Journal, 10(4), 2001.]]
[16]
A. Y. Halevy, Z. G. Ives, D. Suciu, and I. Tatarinov. Schema mediation in peer data management systems. In Proc. of ICDE, 2003.]]
[17]
I. Horrocks, F. van Harmelen, and P. Patel-Schneider. DAML+OIL. http://www.daml.org/2001/03/daml+oil-index.html, March 2001.]]
[18]
Z. Ives, A. Halevy, and D. Weld. An xml query engine for network-bound data. VLDB Journal, Special Issue on XML Query Processing, 2003.]]
[19]
V. Kashyap. The semantic web: Has the db community missed the bus (again)? In Proceedings of the NSF Workshop on DB & IS Research on the Semantic Web and Enterprises, Amicalola, GA, 2002.]]
[20]
D. Lembo, M. Lenzerini, and R. Rosati. Source inconsistency and incompleteness in data integration. In KRDB '02, April 2002.]]
[21]
A. Levy and M.-C. Rousset. Combining Horn rules and description logics in carin. Artificial Intelligence, 104:165--209, September 1998.]]
[22]
A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying heterogeneous information sources using source descriptions. In Proc. of VLDB, pages 251--262, Bombay, India, 1996.]]
[23]
D. L. McGuinness, R. Fikes, J. Rice, and S. Wilder. The Chimæra ontology environment. In AAAI '00, 2000.]]
[24]
E. Mena, V. Kashyap, A. Illarramendi, and A. P. Sheth. Imprecise answers in distributed environments: Estimation of information loss for multi-ontology based query processing. International Journal of Cooperative Information Systems, 9(4):403--425, 2000.]]
[25]
W. Nejdl, B. Wolf, C. Qu, S. Decker, M. Sintek, A. Naeve, M. Nilsson, M. Palmer, and T. Risch. EDUTELLA: A P2P networking infrastructure based on RDF. In Eleventh International World Wide Web Conference, pages 604--615, 2002.]]
[26]
N. F. Noy and M. A. Musen. PROMPT: Algorithm and tool for ontology merging and alignment. In AAAI '00, 2000.]]
[27]
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In ICDE '95, pages 251--260, 1995.]]
[28]
P. Patel-Schneider and J. Simeon. Building the Semantic Web on XML. In Int'l Semantic Web Conference '02, June 2002.]]
[29]
E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB Journal, 10(4):334--350, 2001.]]
[30]
D. D. Roure, I. Foster, E. Miller, J. Hendler, and C. Goble. The semantic grid: The grid meets the semantic web. Panel at the WWW Conference, Honolulu, Hawaii, 2002.]]
[31]
M. Rys Bringing the internet to your database: Using SQLServer 2000 and XML to build loosely-coupled systems. In ICDE '01, pages 465--472, 2001.]]
[32]
A. P. Sheth and J. A. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys, 22(3):183--236, 1990.]]
[33]
P. Westerman. Data Warehousing: Using the Wal-Mart Model. Morgan Kaufmann Publishers, 2000.]]

Cited By

View all

Index Terms

  1. Piazza: data management infrastructure for semantic web applications

                      Recommendations

                      Comments

                      Information & Contributors

                      Information

                      Published In

                      cover image ACM Conferences
                      WWW '03: Proceedings of the 12th international conference on World Wide Web
                      May 2003
                      772 pages
                      ISBN:1581136803
                      DOI:10.1145/775152
                      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                      Sponsors

                      Publisher

                      Association for Computing Machinery

                      New York, NY, United States

                      Publication History

                      Published: 20 May 2003

                      Permissions

                      Request permissions for this article.

                      Check for updates

                      Author Tags

                      1. XML
                      2. peer data management systems
                      3. semantic web

                      Qualifiers

                      • Article

                      Acceptance Rates

                      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

                      Contributors

                      Other Metrics

                      Bibliometrics & Citations

                      Bibliometrics

                      Article Metrics

                      • Downloads (Last 12 months)24
                      • Downloads (Last 6 weeks)1
                      Reflects downloads up to 03 Feb 2025

                      Other Metrics

                      Citations

                      Cited By

                      View all

                      View Options

                      Login options

                      View options

                      PDF

                      View or Download as a PDF file.

                      PDF

                      eReader

                      View online with eReader.

                      eReader

                      Figures

                      Tables

                      Media

                      Share

                      Share

                      Share this Publication link

                      Share on social media