skip to main content
column

Data management projects at Google

Published: 01 March 2008 Publication History

Abstract

This article describes some of the ongoing research projects related to structured data management at Google today. The organization of Google encourages research scientists to work closely with engineering teams. As a result, the research projects tend to be motivated by real needs faced by Google's products and services, and solutions are put into production and tested rapidly. In addition, because of the sheer scale at which Google operates, the engineering challenges faced by Google's services often require research innovations.

References

[1]
E. Y. Chang, K. Zhu et al., Parallelizing Support Vector Machines on Distributed Computers. Proceedings of NIPS 2007. downloadable open source at http://code.google.com/p/psvm/.
[2]
Chang, F., et al. Bigtable: A Distributed Storage System for Structured Data. In Proc. of the 7th OSDI (Dec. 2006), pp. 205--218.
[3]
Dean, J., and Ghemawat, S. MapReduce: Simplified data processing on large clusters. In Proc. of the 6th OSDI (Dec. 2004), pp. 137--150.
[4]
Dong X. and Halevy A. Indexing Dataspaces. Proceedings of the International Conference on Management of Data (SIGMOD), pp. 43--54, 2007.
[5]
Dong X., Halevy A., and Yu C. Data Integration with Uncertainty. International Conference on Very Large Databases (VLDB), pp. 687--698, 2007.
[6]
Franklin M., Halevy A., and Maier D. From databases to dataspaces: a new abstraction for information management. SIGMOD Record, 34(4): 27--33, 2005.
[7]
Ghemawat, S., Gobioff, H., and Leung, S.-T. The Google file system. In Proc. of the 19th ACM SOSP (Dec. 2003), pp. 29--43.
[8]
Madhavan J., Cohen S., Dong X., Halevy A., Jeffery S., Ko D., and Yu C. Web-Scale Data Integration: You can only afford to Pay as You Go. Proceedings of CIDR, pp. 342--350, 2007.
[9]
Pike, R., Dorward, S., Griesemer, R., and Quinlan, S. Interpreting the data: Parallel analysis with Sawzall. Scientific Programming Journal 13, 4 (2005), 227--298.
[10]
http://liaba.tianya.cn.
[11]
Google Open Social. http://code.google.com/apis/opensocial.
[12]
http://otvety.google.ru/otvety/. http://wenda.tianya.cn.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 37, Issue 1
March 2008
61 pages
ISSN:0163-5808
DOI:10.1145/1374780
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 March 2008
Published in SIGMOD Volume 37, Issue 1

Check for updates

Qualifiers

  • Column

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Representation Learning on Large and Small DataBig Data Analytics for Large‐Scale Multimedia Search10.1002/9781119376996.ch1(1-28)Online publication date: 15-Mar-2019
  • (2018)ContinuumProceedings of the ACM Symposium on Cloud Computing10.1145/3267809.3267817(26-40)Online publication date: 11-Oct-2018
  • (2018)Design and achievement of cloud geodatabase for a sponge city海绵城市云空间数据库设计与实现Journal of Central South University10.1007/s11771-018-3926-125:10(2423-2437)Online publication date: 12-Nov-2018
  • (2014)The power of choice in data-aware cluster schedulingProceedings of the 11th USENIX conference on Operating Systems Design and Implementation10.5555/2685048.2685072(301-316)Online publication date: 6-Oct-2014
  • (2013)Transaction chainsProceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles10.1145/2517349.2522729(276-291)Online publication date: 3-Nov-2013
  • (2013)Benchmarking Apache Accumulo BigData Distributed Table Store Using Its Continuous Test SuiteProceedings of the 2013 IEEE International Congress on Big Data10.1109/BigData.Congress.2013.51(334-341)Online publication date: 27-Jun-2013
  • (2013)The ontological keyThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-013-0323-022:5(615-640)Online publication date: 1-Oct-2013
  • (2013)Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems PrinciplesundefinedOnline publication date: 3-Nov-2013
  • (2012)Deep Web Query Interface Understanding and IntegrationSynthesis Lectures on Data Management10.2200/S00419ED1V01Y201205DTM0264:4(1-168)Online publication date: 14-Jun-2012
  • (2012)Cost models for view materialization in the cloudProceedings of the 2012 Joint EDBT/ICDT Workshops10.1145/2320765.2320788(47-54)Online publication date: 30-Mar-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media