research-article

Auto-tuning interactive ray tracing using an analytical GPU architecture model

Authors:

Per Ganestam,

Michael DoggettAuthors Info & Claims

GPGPU-5: Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units

Pages 94 - 100

https://doi.org/10.1145/2159430.2159441

Published: 03 March 2012 Publication History

Get Access

Abstract

This paper presents a method for auto-tuning interactive ray tracing on GPUs using a hardware model. Getting full performance from modern GPUs is a challenging task. Workloads which require a guaranteed performance over several runs must select parameters for the worst performance of all runs. Our method uses an analytical GPU performance model to predict the current frame's rendering time using a selected set of parameters. These parameters are then optimised for a selected frame rate performance on the particular GPU architecture. We use auto-tuning to determine parameters such as phong shading, shadow rays and the number of ambient occlusion rays. We sample a priori information about the current rendering load to estimate the frame workload. A GPU model is run iteratively using this information to tune rendering parameters for a target frame rate. We use the OpenCL API allowing tuning across different GPU architectures. Our auto-tuning enables the rendering of each frame to execute in a predicted time, so a target frame rate can be achieved even with widely varying scene complexities. Using this method we can select optimal parameters for the current execution taking into account the current viewpoint and scene, achieving performance improvements over predetermined parameters.

References

[1]

T. Aila and S. Laine. Understanding the Efficiency of Ray Traversal on GPUs. In Proc. High-Performance Graphics 2009, pages 145--149, 2009.

Digital Library

Google Scholar

[2]

S. S. Baghsorkhi, M. Delahaye, S. J. Patel, W. D. Gropp, and W.-m. W. Hwu. An adaptive performance modeling tool for gpu architectures. SIGPLAN Not., 45:105--114, January 2010.

Digital Library

Google Scholar

[3]

A. Bakhoda, G. L. Yuan, W. W. L. Fung, H. Wong, and T. M. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In International Symposium on Performance Analysis of Systems and Software (ISPASS), 2009.

Crossref

Google Scholar

[4]

W. Dally. Power Efficient Supercomputing. Accelerator-based Computing and Manycore Workshop (presentation), 2009.

Google Scholar

[5]

A. Davidson, Y. Zhang, and J. D. Owens. An auto-tuned method for solving large tridiagonal systems on the GPU. In Proceedings of the 25th IEEE International Parallel and Distributed Processing Symposium, May 2011.

Digital Library

Google Scholar

[6]

T. A. Funkhouser and C. H. Séquin. Adaptive display algorithm for interactive frame rates during visualization of complex virtual environments. In Proceedings ACM SIGGRAPH, pages 247--254, 1993.

Digital Library

Google Scholar

[7]

S. Hong and H. Kim. An analytical model for a gpu architecture with memory-level and thread-level parallelism awareness. SIGARCH Comput. Archit. News, 37(3):152--163, 2009.

Digital Library

Google Scholar

[8]

J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone, and J. C. Phillips. GPU computing. Proceedings of the IEEE, 96(5):879--899, May 2008.

Crossref

Google Scholar

[9]

S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu. Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 73--82, 2008.

Digital Library

Google Scholar

[10]

H. Wong, M.-M. Papadopoulou, M. Sadooghi-Alvandi, and A. Moshovos. Demystifying gpu microarchitecture through microbenchmarking. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2010.

Crossref

Google Scholar

Cited By

View all

Grigoryan DChou YAamodt T(2024)Zatel: Sample Complexity-Aware Scale-Model Simulation for Ray Tracing2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS61541.2024.00024(156-166)Online publication date: 5-May-2024
https://doi.org/10.1109/ISPASS61541.2024.00024
Herveau KPfaffe PTillmann MTichy WDachsbacher C(2023)Analysis of Acceleration Structure Parameters and Hybrid Autotuning for Ray TracingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311349929:2(1345-1356)Online publication date: 1-Feb-2023
https://doi.org/10.1109/TVCG.2021.3113499
Arndt ORallapalli PBlume H(2019)Portable implementations for heterogeneous hardware platforms in autonomous driving systemsBig Data Analytics for Cyber-Physical Systems10.1016/B978-0-12-816637-6.00006-3(113-143)Online publication date: 2019
https://doi.org/10.1016/B978-0-12-816637-6.00006-3
Show More Cited By

Recommendations

GPU ray tracing

The NVIDIA® OptiX™ ray tracing engine is a programmable system designed for NVIDIA GPUs and other highly parallel architectures. The OptiX engine builds on the key observation that most ray tracing algorithms can be implemented using a small set of ...
Ray tracing dynamic scenes with shadows on GPU
EG PGV'10: Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization

We present fast ray tracing of dynamic scenes in this paper with primary and shadow rays. We present a GPUfriendly strategy to bring coherency to shadow rays, based on previous work on grids as acceleration structures. We introduce indirect mapping of ...
Ray tracing via GPU rasterization

Ray tracing is a dominant method for generating a wide variety of global illumination effects, such as reflections/refractions, shadows, etc. In this paper, we propose an efficient technique to perform nearly accurate ray tracing using the programmable ...

Comments

Information & Contributors

Information

Published In

GPGPU-5: Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units

March 2012

122 pages

ISBN:9781450312332

DOI:10.1145/2159430

Editors:
David Kaeli
Northeastern University, Boston, MA
,
John Cavazos
University of Delaware, Newark, DE
,
Enqiang Sun

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 March 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Conference

GPGPU-5

Sponsor:

GPGPU-5: The 5th Annual Workshop on General Purpose Processing with Graphics Processing Units

March 3, 2012

London, United Kingdom

Acceptance Rates

Overall Acceptance Rate 57 of 129 submissions, 44%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
259
Total Downloads

Downloads (Last 12 months)12
Downloads (Last 6 weeks)1

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Grigoryan DChou YAamodt T(2024)Zatel: Sample Complexity-Aware Scale-Model Simulation for Ray Tracing2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS61541.2024.00024(156-166)Online publication date: 5-May-2024
https://doi.org/10.1109/ISPASS61541.2024.00024
Herveau KPfaffe PTillmann MTichy WDachsbacher C(2023)Analysis of Acceleration Structure Parameters and Hybrid Autotuning for Ray TracingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311349929:2(1345-1356)Online publication date: 1-Feb-2023
https://doi.org/10.1109/TVCG.2021.3113499
Arndt ORallapalli PBlume H(2019)Portable implementations for heterogeneous hardware platforms in autonomous driving systemsBig Data Analytics for Cyber-Physical Systems10.1016/B978-0-12-816637-6.00006-3(113-143)Online publication date: 2019
https://doi.org/10.1016/B978-0-12-816637-6.00006-3
Tillmann MPfaffe PKaag CTichy W(2016)Online-Autotuning of Parallel SAH kD-Trees2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2016.31(628-637)Online publication date: May-2016
https://doi.org/10.1109/IPDPS.2016.31
Lopez-Novoa UMendiburu AMiguel-Alonso J(2015)A Survey of Performance Modeling and Simulation Techniques for Accelerator-Based ComputingIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.230821626:1(272-281)Online publication date: Jan-2015
https://doi.org/10.1109/TPDS.2014.2308216
Weber NGoesele MDachsbacher C(2014)Auto-tuning complex array layouts for GPUsProceedings of the 14th Eurographics Symposium on Parallel Graphics and Visualization10.5555/2855568.2855580(57-64)Online publication date: 9-Jun-2014
https://dl.acm.org/doi/10.5555/2855568.2855580

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Recommendations

GPU ray tracing

Ray tracing dynamic scenes with shadows on GPU

Ray tracing via GPU rasterization