skip to main content
10.1145/1966901.1966909acmotherconferencesArticle/Chapter ViewAbstractPublication PageslwdmConference Proceedingsconference-collections
research-article

Exploring the web with OXPath

Published: 25 March 2011 Publication History

Abstract

OXPath is a careful extension of XPath that facilitates data extraction from the deep web. It is designed to facilitate the large-scale extraction of data from sophisticated modern web interfaces with client-side scripting and asynchronous server communication. Its main characteristics are (1) a minimal extension of XPath to allow page navigation and action execution, (2) a set-theoretic formal semantics for full OXPath, (3) and a sophisticated memory management that minimizes page buffering. In this poster, we briefly review the main features of the language and discuss ongoing and future work.

References

[1]
T. Furche, G. Gottlob, G. Grasso, C. Schallhart, and A. Sellers. Finding an OXPath to Cherries Hidden in the Scripted Web. Tech. rep., diadem-project.info/oxpath.
[2]
M. Marx. Conditional XPath. TODS, 2005.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
LWDM '11: Proceedings of the 1st International Workshop on Linked Web Data Management
March 2011
41 pages
ISBN:9781450306089
DOI:10.1145/1966901

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. AJAX
  2. XPath
  3. web automation
  4. web extraction

Qualifiers

  • Research-article

Funding Sources

Conference

EDBT/ICDT '11

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 105
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media