skip to main content
research-article
Open access

Mesa: a geo-replicated online data warehouse for Google's advertising system

Published: 24 June 2016 Publication History

Abstract

Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and retrieval, as well as high availability, reliability, fault tolerance, and scalability for large data and query volumes. Specifically, Mesa handles petabytes of data, processes millions of row updates per second, and serves billions of queries that fetch trillions of rows per day. Mesa is geo-replicated across multiple datacenters and provides consistent and repeatable query answers at low latency, even when an entire datacenter fails. This paper presents the Mesa system and reports the performance and scale that it achieves.

References

[1]
Abouzeid, A., Bajda-Pawlikowski, K., et al. HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads. PVLDB 2, 1 (2009), 922--933.
[2]
Baker, J., Bond, C., et al. Megastore: Providing scalable, highly available storage for interactive services. In CIDR (2011). 223--234.
[3]
Chang, F., Dean, J., et al. Bigtable: A distributed storage system for structured data. In OSDI (2006). 205--218.
[4]
Cohen, J., Eshleman, J., et al. Online expansion of largescale data warehouses. PVLDB 4, 12 (2011), 1249--1259.
[5]
Corbett, J.C., Dean, J., et al. Spanner: Google's globally-distributed database. In OSDI (2012). 251--264.
[6]
Dean, J., Ghemawat, S. MapReduce: Simplified data processing on large clusters. Commun. ACM 51, 1 (2008), 107--113.
[7]
Fikes, A. Storage architecture and challenges. http://goo.gl/pF6kmz, 2010.
[8]
Glendenning, L., Beschastnikh, I., et al. Scalable consistency in scatter. In SOSP (2011). 15--28.
[9]
Gupta, A., Yang, F., et al. Mesa: Geo-replicated, near real-time, scalable data warehousing. In VLDB (2014).
[10]
Lamb, A., Fuller, M., et al. The Vertica analytic database: C-Store 7 years later. PVLDB 5, 12 (2012), 1790--1801.
[11]
Lamport, L. The part-time parliament. ACM Trans. Comput. Syst. 16, 2 (1998), 133--169.
[12]
Lee, G., Lin, J., et al. The unified logging infrastructure for data analytics at Twitter. PVLDB 5, 12 (2012), 1771--1780.
[13]
Project Voldemort: A Distributed Database. http://www.project-voldemort.com/voldemort/.
[14]
SAP HANA. http://www.saphana.com/welcome.
[15]
Shute, J., Vingralek, R., et al. F1: A distributed SQL database that scales. PVLDB 6, 11 (2013), 1068--1079.
[16]
Thusoo, A., Shao, Z., et al. Data warehousing and analytics infrastructure at Facebook. In SIGMOD (2010). 1013--1020.
[17]
Weiss, R. A technical overview of the oracle exadata database machine and exadata storage server. Oracle White Paper. Oracle Corporation, Redwood Shores, 2012.

Cited By

View all

Index Terms

  1. Mesa: a geo-replicated online data warehouse for Google's advertising system

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Communications of the ACM
      Communications of the ACM  Volume 59, Issue 7
      July 2016
      118 pages
      ISSN:0001-0782
      EISSN:1557-7317
      DOI:10.1145/2963119
      • Editor:
      • Moshe Y. Vardi
      Issue’s Table of Contents
      This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 June 2016
      Published in CACM Volume 59, Issue 7

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)573
      • Downloads (Last 6 weeks)61
      Reflects downloads up to 05 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Digital Edition

      View this article in digital edition.

      Digital Edition

      Magazine Site

      View this article on the magazine site (external)

      Magazine Site

      Login options

      Full Access

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media