The BigDAWG Polystore System and Architecture

Gadepally, Vijay; Chen, Peinan; Duggan, Jennie; Elmore, Aaron; Haynes, Brandon; Kepner, Jeremy; Madden, Samuel; Mattson, Tim; Stonebraker, Michael

doi:10.1109/HPEC.2016.7761636

Computer Science > Databases

arXiv:1609.07548 (cs)

[Submitted on 24 Sep 2016]

Title:The BigDAWG Polystore System and Architecture

Authors:Vijay Gadepally, Peinan Chen, Jennie Duggan, Aaron Elmore, Brandon Haynes, Jeremy Kepner, Samuel Madden, Tim Mattson, Michael Stonebraker

View PDF

Abstract:Organizations are often faced with the challenge of providing data management solutions for large, heterogenous datasets that may have different underlying data and programming models. For example, a medical dataset may have unstructured text, relational data, time series waveforms and imagery. Trying to fit such datasets in a single data management system can have adverse performance and efficiency effects. As a part of the Intel Science and Technology Center on Big Data, we are developing a polystore system designed for such problems. BigDAWG (short for the Big Data Analytics Working Group) is a polystore system designed to work on complex problems that naturally span across different processing or storage engines. BigDAWG provides an architecture that supports diverse database systems working with different data models, support for the competing notions of location transparency and semantic completeness via islands and a middleware that provides a uniform multi--island interface. Initial results from a prototype of the BigDAWG system applied to a medical dataset validate polystore concepts. In this article, we will describe polystore databases, the current BigDAWG architecture and its application on the MIMIC II medical dataset, initial performance results and our future development plans.

Comments:	6 pages, 5 figures, IEEE High Performance Extreme Computing (HPEC) conference 2016
Subjects:	Databases (cs.DB)
Cite as:	arXiv:1609.07548 [cs.DB]
	(or arXiv:1609.07548v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1609.07548
Related DOI:	https://doi.org/10.1109/HPEC.2016.7761636

Submission history

From: Jeremy Kepner [view email]
[v1] Sat, 24 Sep 2016 01:14:06 UTC (514 KB)

Computer Science > Databases

Title:The BigDAWG Polystore System and Architecture

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:The BigDAWG Polystore System and Architecture

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators