skip to main content
10.1145/2771937.2771944acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

By their fruits shall ye know them: A Data Analyst's Perspective on Massively Parallel System Design

Published: 31 May 2015 Publication History

Abstract

Increasingly parallel systems promise a remedy for the current stagnation of single-core performance. However, the battle to find the most appropriate architecture for the resulting massively parallel systems is still ongoing. Currently, there are two active contenders: Massively Parallel Single Instruction Multiple Threads (SIMT) systems such as GPGPUs and Many Core Single Instruction Multiple Data (SIMD) systems such as Intel's Xeon Phi. While the former is more versatile, the latter is an efficient, time-tested technology with a clear migration path. In this study, we provide a data management perspective to the debate: we study the implementation and performance of a set of common data management operations on an SIMT device (an Nvidia GTX 780) and compare it to a Many Core SIMD system (an Intel Xeon Phi). We interpret the results to pinpoint architectural decisions and tradeoffs that lead to suboptimal performance and point out potential areas for improvement in the next generation of these devices.

References

[1]
Balkesen, C., Teubner, J., Alonso, G., and Ozsu, M. T. Main-memory hash joins on multi-core cpus: Tuning to the underlying hardware. In Data Engineering (ICDE), 2013 IEEE 29th International Conference on (2013), IEEE, pp. 362--373.
[2]
Batcher, K. E. Sorting networks and their applications. In AFIPS Conference Proceedings (1968), pp. 307--314.
[3]
Boncz, P., Neumann, T., and Erling, O. Tpc-h analyzed: Hidden messages and lessons learned from an influential benchmark. In Performance Characterization and Benchmarking. Springer, 2014, pp. 61--76.
[4]
Fang, J., Sips, H., Zhang, L., Xu, C., Che, Y., and Varbanescu, A. L. Test-driving intel xeon phi. In Proceedings of the 5th ACM/SPEC international conference on Performance engineering (2014), ACM, pp. 137--148.
[5]
Fernando, R., Haines, E., and Sweeney, T. GPU Gems: Programming Techniques, Tips and Tricks for Real-Time Graphics. Addision-Wesley, 2004.
[6]
Govindaraju, N. K., and Manocha, D. Efficient relational database management using graphics processors. In Proceedings of the 1st international workshop on Data management on new hardware (2005), ACM, p. 1.
[7]
He, B., Lu, M., Yang, K., Fang, R., Govindaraju, N. K., Luo, Q., and Sander, P. V. Relational query coprocessing on graphics processors. ACM Transactions on Database Systems (TODS) 34, 4 (2009), 21.
[8]
Jha, S., He, B., Lu, M., Cheng, X., and Huynh, H. P. Improving main memory hash joins on intel xeon phi processors: An experimental approach. Proc. VLDB Endow. 8, 6 (Feb. 2015), 642--653.
[9]
Kim, C., Kaldewey, T., Lee, V. W., Sedlar, E., Nguyen, A. D., Satish, N., Chhugani, J., Di Blas, A., and Dubey, P. Sort vs. hash revisited: Fast join implementation on modern multi-core cpus. Proceedings of the VLDB Endowment 2, 2 (2009), 1378--1389.
[10]
Ladner, R. E., and Fischer, M. J. Parallel prefix computation. Journal of the ACM (JACM) 27, 4 (1980), 831--838.
[11]
Nassimi, D., and Sahni, S. Data broadcasting in simd computers. Computers, IEEE Transactions on 100, 2 (1981), 101--107.
[12]
Nguyen, H. GPU Gems 3. Addison-Wesley Professional, 2007.
[13]
Pharr, M., and Fernando, R. GPU Gems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation. Addison-Wesley Professional, 2005.
[14]
Pirk, H., Funke, F., Grund, M., Neumann, T., Leser, U., Manegold, S., Kemper, A., and Kersten, M. Cpu and cache efficient management of memory-resident databases. In Data Engineering (ICDE), 2013 IEEE 29th International Conference on (2013), IEEE, pp. 14--25.
[15]
Pirk, H., Petraki, E., Idreos, S., Manegold, S., and Kersten, M. Database cracking: fancy scan, not poor man's sort! In Proceedings of the Tenth International Workshop on Data Management on New Hardware (2014), ACM, p. 4.
[16]
Reinders, J., and Jeffers, J. High Performance Parallelism Pearls: Multicore and Many-core Programming Approaches. Morgan Kaufmann, 2014.

Cited By

View all

Index Terms

  1. By their fruits shall ye know them: A Data Analyst's Perspective on Massively Parallel System Design

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      DaMoN'15: Proceedings of the 11th International Workshop on Data Management on New Hardware
      May 2015
      100 pages
      ISBN:9781450336383
      DOI:10.1145/2771937
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 31 May 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      SIGMOD/PODS'15
      Sponsor:
      SIGMOD/PODS'15: International Conference on Management of Data
      May 31 - June 4, 2015
      VIC, Melbourne, Australia

      Acceptance Rates

      DaMoN'15 Paper Acceptance Rate 12 of 16 submissions, 75%;
      Overall Acceptance Rate 94 of 127 submissions, 74%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 24 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media