skip to main content
10.1145/2818427.2818442acmotherconferencesArticle/Chapter ViewAbstractPublication Pagessiggraph-asiaConference Proceedingsconference-collections
other

An efficient hybrid ray tracing and rasterizer architecture for mobile GPU

Published: 02 November 2015 Publication History

Abstract

We present a bandwidth- and energy-efficient, hybrid ray tracing and rasterizer architecture for tile-based mobile GPU. In order to successfully commercialize mobile System on Chip (SoC), including ray tracing hardware solution, effective integration with rasterizer based on OpenGL---ES is indispensable for the performance and compatibility reason. Thus, recently, the traditional rasterizer-ray tracing hybrid approach is revisited to achieve this goal. The key factor for hybrid rendering is to reflect the fundamental principle of tile-based rendering on integrating the ray tracing hardware and mobile GPUs. Consequently, we propose a new architecture for hybrid rendering by combining new three features such as extended tile binning unit, tile prefetch, and per-tile power control. Simulation results show that our architecture is a potentially versatile solution for future mobile GPUs in low-energy devices because it provides as much as 31.7% better G-buffer bandwidth utilization and is up to 2.18 times better performance per unit energy compared to the ray tracing hardware-only solution.

References

[1]
ARM, 2014. EXT_shader_pixel_local_storage. https://www.khronos.org/registry/gles/extensions/EXT/EXT_shader_pixel_local_storage.txt.
[2]
ARM, 2015. ARM flagship mobile GPU, Mali-T760. http://www.arm.com/products/multimedia/mali-performance-efficientgraphics/mali-t760.php.
[3]
Boulos, S., Edwards, D., Lacewell, J. D., Kniss, J., Kautz, J., Shirley, P., and Wald, I. 2007. Packet-based whitted and distribution ray tracing. In Proceedings of Graphics Interface (GI) 2007, 177--184.
[4]
CompuBenchCL, 2015. CompuBench CL. https://compubench.com.
[5]
Deering, M., Winner, S., Schediwy, B., Duffy, C., and Hunt, N. 1988. The triangle processor and normal vector shader: a VLSI system for high performance graphics. In Proceedings of ACM SIGGRAPH 1988, 21--30.
[6]
Ernst, M., and Greiner, G. 2008. Multi bounding volume hierarchies. In Proceedings of IEEE Symposium on Interactive Ray Tracing (IRT) 2008, 35--40.
[7]
Fujita, M., and Harada, T. 2014. Foveated real-time ray tracing for virtual reality headset. Tech. rep., Light Transport Entertainment Research.
[8]
Garanzha, K., and Loop, C. 2010. Fast ray sorting and breadth-first packet traversal for GPU ray tracing. Computer Graphics Forum (Proceedings of EUROGRAPHICS 2010) 29, 2, 289--298.
[9]
Horowitz, M., Indermaur, T., and Gonzalez, R. 1994. Low-power digital design. In Proceedings of IEEE Symposium on Low Power Electronics and Design. (ISLPE94) Digest of Techical Papers, 8--11.
[10]
Kopta, D., Shkurko, K., Spjut, J., Brunvand, E., and Davis, A. 2013. An energy and bandwidth efficient ray tracing architecture. In Proceedings of ACM High Performance Graphics (HPG) 2013, 121--128.
[11]
Lee, W.-J., Woo, S.-O., Kwon, K.-T., Son, S.-J., Min, K.-J., Lee, C.-H., Jang, K.-J., Park, C.-M., Jung, S.-Y., and Lee, S.-H. 2011. A scalable GPU architecture based on dynamically embedded reconfigurable processor. In Proceedings of ACM High Performance Graphics 2011, Posters.
[12]
Lee, W.-J., Shin, Y., Lee, J., Kim, J.-W., Nah, J.-H., Jung, S.-Y., Lee, S.-H., and Han, H.-S. P. T.-D. 2013. SGRT: A mobile GPU architecture for real-time ray tracing. In Proceedings of ACM High Performance Graphics (HPG) 2013, 109--119.
[13]
Lee, J., Lee, W.-J., Shin, Y., Hwang, S. J., Ryu, S., and Kim, J. 2014. Two-AABB traversal for mobile real-time ray tracing. In Proceedings of ACM SIGGRAPH Asia 2014, Symposium on Mobile Graphics and Interactive Applications (MGIA), Article No. 14.
[14]
Lee, W.-J., Shin, Y., Lee, J., Hwang, S. J., Ryu, S., and Kim, J. 2014. An energy efficient hardware multithreading scheme for mobile ray tracing. In Proceedings of ACM SIGGRAPH Asia 2014, Symposium on Mobile Graphics and Interactive Applications (MGIA), Article No. 1.
[15]
McCombe, J. 2014. New techniques made possible by PowerVR ray tracing hardware. In Game Developer Conferecne (GDC) 2014, Technical Talk.
[16]
Muralimanohar, N., Balasubramonian, R., and Jouppi, N. 2007. Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0. In Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, 3--14.
[17]
Nah, J.-H., Kwon, H.-J., Kim, D.-S., Jeong, C.-H., Park, J., Han, T.-D., Manocha, D., and Park, W.-C. 2014. RayCore: A ray-tracing hardware architecture for mobile devices. ACM Transactions on Graphics (TOG) 33, 6, Article No. 162.
[18]
Parker, S. G., Bigler, J., Dietrich, A., Friedrich, H., Hoberock, J., Luebke, D., McAllister, D., McGuire, M., Morley, K., Robison, A., and Stich, M. 2010. OptiX: a general purpose ray tracing engine. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH 2010) 66, 4, Article No. 66.
[19]
Qualcomm, 2015. Qualcomm flagship mobile GPU, adreno 430. https://www.qualcomm.com/products/snapdragon/gpu.
[20]
Schmittler, J., Woop, S., Wagner, D., Paul, W. J., and Slusallek, P. 2004. Realtime ray tracing of dynamic scenes on an FPGA chip. In Proceedings of ACM SIGGRAPH/EUROGRAPHICS Graphics Hardware, 95--106.
[21]
Synopsis, 2015. PrimeTime PX: Power analysis tool. http://www.synopsys.com/Tools/Implementation/SignOff/Pages/PrimeTime.aspx.
[22]
Tsakok, J. A. 2009. Faster incoherent rays: Multi-BVH ray stream tracing. In Proceedings of ACM High Performance Graphics (HPG) 2009, 151--158.
[23]
Wald, I., Slusallek, P., Benthin, C., and Wagner, M. 2001. Interactive rendering with coherent ray tracing. Computer Graphics Forum 20, 3, 153--165.
[24]
Wald, I., Woop, S., Benthin, C., Johnson, G. S., and Ernst, M. 2014. Embree: A kernel framework for efficient CPU ray tracing. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH 2014) 33, 4, Article No. 143.
[25]
Woop, S., Schmittler, J., and Slusallek, P. 2005. RPU: a programmable ray processing unit for realtime ray tracing. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH 2005) 24, 3, 434--444.

Cited By

View all

Index Terms

  1. An efficient hybrid ray tracing and rasterizer architecture for mobile GPU

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      SA '15: SIGGRAPH Asia 2015 Mobile Graphics and Interactive Applications
      November 2015
      131 pages
      ISBN:9781450339285
      DOI:10.1145/2818427
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 02 November 2015

      Check for updates

      Author Tags

      1. GPU
      2. mobile
      3. multithreading
      4. ray tracing

      Qualifiers

      • Other

      Conference

      SA'15
      SA'15: SIGGRAPH Asia 2015
      November 2 - 6, 2015
      Kobe, Japan

      Acceptance Rates

      Overall Acceptance Rate 178 of 869 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 06 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media