tutorial

A Framework for Multiplatform HPC Applications

Authors:

Shigeru ChibaAuthors Info & Claims

PMAM'14: Proceedings of Programming Models and Applications on Multicores and Manycores

Pages 61 - 69

https://doi.org/10.1145/2578948.2560693

Published: 07 February 2014 Publication History

Abstract

This paper proposes a framework for building multi-platform applications in Java for High Performance Computing (HPC). It allows HPC developers to write their programs in Java but dynamically translate part of the programs into C programs using MPI or CUDA so that the translated code can be executed on multi-platforms. The source of the translated code is written in Java but with extensions for MPI and CUDA supports. The implementations for different platforms are switched by object-oriented mechanisms such as dynamic method dispatch. However, object oriented mechanisms are major sources of execution overheads. To reduce these overheads, the proposed framework requires that the translated code is subject to our coding rules, in which object-oriented mechanisms are available only in limited contexts. All objects except arrays must be immutable and most class types must be leaf classes. Only the types of method parameters and instance fields can be non-leaf class types. These restrictions allow our framework to statically determine object types during the code translation while they still enable building a practical class library for HPC with respect to customizability. This paper presents examples of the class libraries built on top of our framework. Their performance is sometime better than the performance of the programs written in C++ with equivalent class libraries since C++ is a general-purpose language and thus its expressiveness does not perfectly fit our problem domain, HPC applications.

References

[1]

Aparapi. AMD developer central. http://developer.amd.com/zones/java/aparapi/Pages/default.aspx.

[2]

H. Chafi, A. K. Sujeeth, K. J. Brown, H. Lee, A. R. Atreya, and K. Olukotun. A domain-specific approach to heterogeneous parallelism. SIGPLAN Not., 46(8):35--46, Feb. 2011. ISSN 0362-1340. URL http://doi.acm.org/10.1145/2038037.1941561.

Digital Library

[3]

L. Dagum and R. Menon. OpenMP: an industry standard API for shared-memory programming. Computational Science & Engineering, IEEE, 5(1):46--55, 1998.

Digital Library

[4]

J. Dean, D. Grove, and C. Chambers. Optimization of object-oriented programs using static class hierarchy analysis. In ECOOP '95, pages 77--101. Springer-Verlag, 1995.

Digital Library

[5]

J. Dolby. Automatic inline allocation of objects. In PLDI '97, pages 7--17. ACM, 1997.

Digital Library

[6]

J. Dolby and A. Chien. An automatic object inlining optimization and its evaluation. In PLDI '00, pages 345--357. ACM, 2000.

Digital Library

[7]

G. Dotzler, R. Veldema, and M. Klemm. JCudaMP: OpenMP/Java on CUDA. In Proceedings of the 3rd Int'l Workshop on Multicore Software Engineering (IWMSE '10), pages 10--17. ACM, 2010.

Digital Library

[8]

M. F. Fernández. Simple and effective link-time optimization of Modula-3 programs. In PLDI '95, pages 103--115. ACM, 1995.

Digital Library

[9]

G. Ganegoda, D. Samaranayake, L. Bandara, and K. Wimalawarne. JConqurr - a multi-core programming toolkit for Java. Int'l Journal of Computer and Information Engineering, 3(4), 2009.

[10]

K. Ishizaki, M. Kawahito, T. Yasue, H. Komatsu, and T. Nakatani. A study of devirtualization techniques for a Java just-in-time compiler. In OOPSLA '00, pages 294--310. ACM, 2000.

Digital Library

[11]

jcuda.org. jcuda.org - Java bindings for CUDA. http://www.jcuda.de.

[12]

K. C. Kang, S. G. Cohen, J. A. Hess, W. E. Novak, and A. S. Peterson. Feature-oriented domain analysis (FODA) feasibility study. Technical report, DTIC Document, 1990.

[13]

Khronos OpenCL Working Group. The OpenCL specification, 2008.

[14]

C. Mellon. Software product lines --- overview. http://www.sei.cmu.edu/productlines.

[15]

N. Nystrom, D. White, and K. Das. Firepile: run-time compilation for GPUs in Scala. In Proc. of the 10th ACM int'l conf. on Generative Programming and Component Engineering (GPCE '11), pages 107--116. ACM, 2011.

Digital Library

[16]

P. C. Pratt-Szeliga, J. W. Fawcett, and R. D. Welch. Rootbeer: Seamlessly using GPUs from Java. In High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on, pages 375--380. IEEE, 2012.

Digital Library

[17]

Tokyo Institute of Technology. TSUBAME computing services. http://tsubame.gsic.titech.ac.jp.

[18]

C. Wimmer and H. Mössenböck. Automatic feedback-directed object inlining in the Java hotspot virtual machine. In Proc. of the 3rd int'l conf. on Virtual Execution Environments (VEE '07), pages 12--21. ACM, 2007.

Digital Library

[19]

Y. Yan, M. Grossman, and V. Sarkar. JCUDA: A programmer-friendly interface for accelerating Java programs with CUDA. In Euro-Par, pages 887--899, 2009.

Digital Library

[20]

W. Zaremba, Y. Lin, and V. Grover. Jabee: framework for object-oriented Java bytecode compilation and execution on graphics processor units. In Proc. of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units (GPGPU-5), pages 74--83. ACM, 2012.

Digital Library

Cited By

Chiba SZhuang YScherr MZheng YBinder WTůma P(2016)Deeply Reifying Running Code for Constructing a Domain-Specific LanguageProceedings of the 13th International Conference on Principles and Practices of Programming on the Java Platform: Virtual Machines, Languages, and Tools10.1145/2972206.2972219(1-12)Online publication date: 29-Aug-2016
https://dl.acm.org/doi/10.1145/2972206.2972219
Medeiros BSilva RSobral J(2015)Gaspar: a compositional aspect‐oriented approach for cluster applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.366628:8(2353-2373)Online publication date: 7-Oct-2015
https://doi.org/10.1002/cpe.3666
Hori AYoshinaga KTokuhisa AJoti YOkada KSugimoto TYamaga MHatsui TYabashi MSugita YIshikawa YGo NDongarra JIshikawa YHori A(2014)Decoupling Architecture for All-to-all ComputationProceedings of the 21st European MPI Users' Group Meeting10.1145/2642769.2642801(169-174)Online publication date: 9-Sep-2014
https://dl.acm.org/doi/10.1145/2642769.2642801

Index Terms

A Framework for Multiplatform HPC Applications
1. Software and its engineering
  1. Software notations and tools
    1. Compilers

Recommendations

A Framework for Multiplatform HPC Applications
PMAM'14: Proceedings of Programming Models and Applications on Multicores and Manycores

This paper proposes a framework for building multi-platform applications in Java for High Performance Computing (HPC). It allows HPC developers to write their programs in Java but dynamically translate part of the programs into C programs using MPI or ...
Evaluating the Java Native Interface JNI: Leveraging Existing Native Code, Libraries and Threads to a Running Java Virtual Machine

This article aims to explore JNI features and to discover fundamental operations of the Java programming language, such as arrays, objects, classes, threads and exception handling, and to illustrate these by using various algorithms and code samples. ...
A multiplatform Java wrapper for the BioAPI framework

We present a solution for the development of multiplatform and web-oriented Java applications for biometric authentication based on the BioAPI framework. Our proposal is a single Java Native Interface wrapper that is compatible with the BioAPI ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

PMAM'14: Proceedings of Programming Models and Applications on Multicores and Manycores

February 2014

156 pages

ISBN:9781450326575

DOI:10.1145/2578948

Conference Chairs:
Pavan Balaji
Argonne National Laboratory, USA
,
Minyi Guo
Shanghai Jiao Tong, University, China
,
Zhiyi Huang
University of Otago, New Zealand

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGPLAN: ACM Special Interest Group on Programming Languages

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 February 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Tutorial
Research
Refereed limited

Conference

PPoPP '14

Sponsor:

SIGPLAN

PPoPP '14: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

February 15 - 19, 2014

FL, Orlando, USA

Acceptance Rates

Overall Acceptance Rate 53 of 97 submissions, 55%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
178
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Chiba SZhuang YScherr MZheng YBinder WTůma P(2016)Deeply Reifying Running Code for Constructing a Domain-Specific LanguageProceedings of the 13th International Conference on Principles and Practices of Programming on the Java Platform: Virtual Machines, Languages, and Tools10.1145/2972206.2972219(1-12)Online publication date: 29-Aug-2016
https://dl.acm.org/doi/10.1145/2972206.2972219
Medeiros BSilva RSobral J(2015)Gaspar: a compositional aspect‐oriented approach for cluster applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.366628:8(2353-2373)Online publication date: 7-Oct-2015
https://doi.org/10.1002/cpe.3666
Hori AYoshinaga KTokuhisa AJoti YOkada KSugimoto TYamaga MHatsui TYabashi MSugita YIshikawa YGo NDongarra JIshikawa YHori A(2014)Decoupling Architecture for All-to-all ComputationProceedings of the 21st European MPI Users' Group Meeting10.1145/2642769.2642801(169-174)Online publication date: 9-Sep-2014
https://dl.acm.org/doi/10.1145/2642769.2642801

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents