skip to main content
research-article

RTScan: Efficient Scan with Ray Tracing Cores

Published: 03 May 2024 Publication History

Abstract

Indexing is a core technique for accelerating predicate evaluation in databases. After many years of effort, the indexing performance has reached its peak on the existing hardware infrastructure. We propose to use ray tracing (RT) cores to move the indexing performance and efficiency to another level by addressing the following technical challenges: (1) the lack of an efficient mapping of predicate evaluation to a ray tracing job and (2) the poor performance by the heavy and imbalanced ray load when processing skewed datasets. These challenges set obstacles to effectively exploiting RT cores for predicate evaluation.
In this paper, we propose RTScan, an approach that leverages RT cores to accelerate index scans. RTScan transforms the evaluation of conjunctive predicates into an efficient ray tracing job in a three-dimensional space. A set of techniques are designed in RTScan, i.e., Uniform Encoding, Data Sieving, and Matrix RT Refine, which significantly enhances the parallelism of scans on RT cores while lightening and balancing the ray load. With the proposed techniques, RTScan achieves high performance for datasets with either uniform or skewed distributions and queries with different selectivities. Extensive evaluations demonstrate that RTScan enhances the scan performance on RT cores by five orders of magnitude and outperforms the state-of-the-art approach on CPU by up to 4.6×.

References

[1]
Muhammad A Awad, Serban D Porumbescu, and John D Owens. 2022. A GPU Multiversion B-Tree. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques. 481--493.
[2]
Carsten Binnig, Stefan Hildenbrand, and Franz Färber. 2009. Dictionary-based order-preserving string compression for main memory column stores. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. 283--296.
[3]
I Evangelou, G Papaioannou, K Vardis, and AA Vasilakis. 2021. Fast radius search exploiting ray-tracing frameworks. Journal of Computer Graphics Techniques Vol 10, 1 (2021).
[4]
Ziqiang Feng, Eric Lo, Ben Kao, and Wenjian Xu. 2015. Byteslice: Pushing the envelop of main memory data processing with a new storage layout. In SIGMOD. ACM, 31--46.
[5]
Vijay K Garg. 2002. Elements of distributed computing. John Wiley & Sons, 150--151.
[6]
Georgios Giannikis, Darko Makreshanski, Gustavo Alonso, and Donald Kossmann. 2014. Shared workload optimization. Proceedings of the VLDB Endowment 7, 6 (2014), 429--440.
[7]
Joseph M. Hellerstein and Michael Stonebraker. 1993. Predicate migration: optimizing queries with expensive predicates. In Proceedings of the 1993 ACM SIGMOD international conference on Management of data - SIGMOD '93.
[8]
Justus Henneberg and Felix Schuhknecht. 2023. RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing. arXiv preprint arXiv:2303.01139 (2023).
[9]
Brian Hentschel, Michael S Kester, and Stratos Idreos. 2018. Column sketches: A scan accelerator for rapid and robust predicate evaluation. In Proceedings of the 2018 International Conference on Management of Data. 857--872.
[10]
Ryan Johnson, Vijayshankar Raman, Richard Sidle, and Garret Swart. 2008. RowWise Parallel Predicate Evaluation. Proc. VLDB Endow. 1, 1 (aug 2008), 622--634.
[11]
Daniel Jünger, Robin Kobus, André Müller, Christian Hundt, Kai Xu, Weiguo Liu, and Bertil Schmidt. 2020. Warpcore: A library for fast hash tables on gpus. In 2020 IEEE 27th international conference on high performance computing, data, and analytics (HiPC). IEEE, 11--20.
[12]
Fisnik Kastrati and Guido Moerkotte. 2016. Optimization of conjunctive predicates for main memory column stores. Proceedings of the VLDB Endowment 9, 12 (2016), 1125--1136.
[13]
Linwei Li, Kai Zhang, Jiading Guo, Wen He, Zhenying He, Yinan Jing, Weili Han, and X Sean Wang. 2020. Bindex: A two-layered index for fast and robust scans. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 909--923.
[14]
Yinan Li and Jignesh M. Patel. 2013. BitWeaving: Fast Scans for Main Memory Data Processing. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data (New York, New York, USA) (SIGMOD '13). Association for Computing Machinery, New York, NY, USA, 289--300.
[15]
Enzo Meneses, Cristóbal A. Navarro, Héctor Ferrada, and Felipe A. Quezada. 2023. Accelerating Range Minimum Queries with Ray Tracing Cores. arXiv:2306.03282 [cs.DC]
[16]
Guido Moerkotte. 1998. Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing. In Proc. VLDB Endow. 476--487.
[17]
Vani Nagarajan and Milind Kulkarni. 2023. RT-DBSCAN: Accelerating DBSCAN using Ray Tracing Hardware. arXiv:2303.09655 [cs.DC]
[18]
Vani Nagarajan, Durga Mandarapu, and Milind Kulkarni. 2023. RT-KNNS Unbound: Using RT Cores to Accelerate Unrestricted Neighbor Search. In Proceedings of the 37th International Conference on Supercomputing (Orlando, FL, USA) (ICS '23). Association for Computing Machinery, New York, NY, USA, 289--300.
[19]
Thomas Neumann, Sven Helmer, and Guido Moerkotte. 2005. On the optimal ordering of maps and selections under factorization. In 21st International Conference on Data Engineering (ICDE'05). IEEE, 490--501.
[20]
NVIDIA. 2018. NVIDIA Turing GPU architecture. (2018), 25--29, 30--32. https://images.nvidia.cn/aem-dam/en-zz/Solutions/design-visualization/technologies/turing-architecture/NVIDIA-Turing-Architecture-Whitepaper.pdf
[21]
Iraklis Psaroudakis, Manos Athanassoulis, and Anastasia Ailamaki. 2013. Sharing Data and Work across Concurrent Analytical Queries. Proc. VLDB Endow. 6, 9 (2013), 637--648.
[22]
Lin Qiao, Vijayshankar Raman, Frederick Reiss, Peter J Haas, and Guy M Lohman. 2008. Main-memory scan sharing for multi-core CPUs. Proceedings of the VLDB Endowment 1, 1 (2008), 610--621.
[23]
Kenneth A Ross. 2002. Conjunctive selection conditions in main memory. In Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 109--120.
[24]
P Griffiths Selinger, Morton M Astrahan, Donald D Chamberlin, Raymond A Lorie, and Thomas G Price. 1979. Access path selection in a relational database management system. In Proceedings of the 1979 ACM SIGMOD international conference on Management of data. 23--34.
[25]
Peter Shirley, Ingo Wald, Tomas Akenine-Möller, and Eric Haines. 2019. What is a Ray? Apress, Berkeley, CA, 15--19.
[26]
Juliusz Sompolski, Marcin Zukowski, and Peter Boncz. 2011. Vectorization vs. compilation in query execution. In Proceedings of the Seventh International Workshop on Data Management on New Hardware. 33--40.
[27]
Mike Stonebraker, Daniel J Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Sam Madden, Elizabeth O'Neil, et al. 2018. C-store: a column-oriented DBMS. In Making Databases Work: the Pragmatic Wisdom of Michael Stonebraker. 491--518.
[28]
Liwen Sun, Michael J Franklin, Sanjay Krishnan, and Reynold S Xin. 2014. Finegrained partitioning for aggressive data skipping. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 1115--1126.
[29]
Liwen Sun, Michael J Franklin, Jiannan Wang, and Eugene Wu. 2016. Skipping-oriented partitioning for columnar layouts. Proceedings of the VLDB Endowment 10, 4 (2016), 421--432.
[30]
Jianguo Wang, Chunbin Lin, Yannis Papakonstantinou, and Steven Swanson. 2017. An experimental study of bitmap compression vs. inverted list compression. In Proceedings of the 2017 ACM International Conference on Management of Data. 993--1008.
[31]
Zeke Wang, Xue Liu, Kai Zhang, Haihang Zhou, and Bingsheng He. 2019. Understanding and Optimizing Conjunctive Predicates Under Memory-Efficient Storage Layouts. IEEE Transactions on Knowledge and Data Engineering 33, 6 (2019), 2803--2817.
[32]
Yuhao Zhu. 2022. RTNN: Accelerating Neighbor Search Using Hardware Ray Tracing. In Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Seoul, Republic of Korea) (PPoPP '22). Association for Computing Machinery, New York, NY, USA, 76--89.
[33]
Marcin Zukowski, Mark Van de Wiel, and Peter Boncz. 2012. Vectorwise: A vectorized analytical DBMS. In 2012 IEEE 28th International Conference on Data Engineering. IEEE, 1349--1350.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 17, Issue 6
February 2024
369 pages
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 03 May 2024
Published in PVLDB Volume 17, Issue 6

Check for updates

Badges

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 135
    Total Downloads
  • Downloads (Last 12 months)135
  • Downloads (Last 6 weeks)21
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media