default search action
25th HiPC 2018: Bengaluru, India
- 25th IEEE International Conference on High Performance Computing, HiPC 2018, Bengaluru, India, December 17-20, 2018. IEEE 2018, ISBN 978-1-5386-8386-6
Keynote 1
- Balaraman Ravindran:
Looking Under the Hood of Deep Neural Networks. 1
Technical Session 1: Learning
- Rajarshi Biswas, Xiaoyi Lu, Dhabaleswar K. Panda:
Accelerating TensorFlow with Adaptive RDMA-Based gRPC. 2-11 - Saurav Basu, Vaibhav Saxena, Rintu Panja, Ashish Verma:
Balancing Stragglers Against Staleness in Distributed Deep Learning. 12-21 - Grey Ballard, Koby Hayashi, Ramakrishnan Kannan:
Parallel Nonnegative CP Decomposition of Dense Tensors. 22-31 - Israt Nisa, Aravind Sukumaran-Rajam, Süreyya Emre Kurt, Changwan Hong, P. Sadayappan:
Sampled Dense Matrix Multiplication for High-Performance Machine Learning. 32-41 - Prasanna Balaprakash, Michael Salim, Thomas D. Uram, Venkat Vishwanath, Stefan M. Wild:
DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks. 42-51
Technical Session 2: Graph Algorithms
- Jesun Sahariar Firoz, Marcin Zalewski, Thejaka Amila Kanewala, Andrew Lumsdaine:
Synchronization-Avoiding Graph Algorithms. 52-61 - Apurba Das, Seyed-Vahid Sanei-Mehri, Srikanta Tirthapura:
Shared-Memory Parallel Maximal Clique Enumeration. 62-71 - Kishore Kothapalli, Mihir Wadwekar:
Expediting Parallel Graph Connectivity Algorithms. 72-81 - Jesun Sahariar Firoz, Marcin Zalewski, Joshua Suetterlein, Andrew Lumsdaine:
Adaptive Runtime Features for Distributed Graph Algorithms. 82-91 - Hiroki Kanezashi, Toyotaro Suzumura, Dario Garcia-Gasulla, Min-hwan Oh, Satoshi Matsuoka:
Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs. 92-101 - Priyanka Singla, Shubhankar Suman Singh, K. Gopinath, Smruti Sarangi:
Probabilistic Sequential Consistency in Social Networks. 102-111
Technical Session 3: GPUs
- Kramer Straube, Jason Lowe-Power, Christopher Nitta, Matthew K. Farrens, Venkatesh Akella:
Improving Provisioned Power Efficiency in HPC Systems with GPU-CAPP. 112-122 - Hancheng Wu, John Ravi, Michela Becchi:
Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA. 123-132 - Karthikeyan Natarajan, Nitin Chandrachoodan:
Lossless Parallel Implementation of a Turbo Decoder on GPU. 133-142 - Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Xiaoyi Lu, Dhabaleswar K. Panda:
OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training. 143-152 - Harichand M. V, Bharatkumar Sharma, G. Sudhakaran, V. Ashok:
Acceleration of an Adaptive Cartesian Mesh CFD Solver in the Current Generation Processor Architectures. 153-161 - Sofia Vallecorsa, Diana Moise, Federico Carminati, Gul Rukh Khattak:
Data-Parallel Training of Generative Adversarial Networks on HPC Systems for HEP Simulations. 162-171
Keynote 2
- Marc Snir:
The Future of Supercomputing. 172
Technical Session 4: Linear Algebra and Fault Tolerance
- Himeshi De Silva, John L. Gustafson, Weng-Fai Wong:
Making Strassen Matrix Multiplication Safe. 173-182 - Omer Subasi, Ramakrishna Tipireddy, Sriram Krishnamoorthy:
Quantification, Trade-off Analysis, and Optimal Checkpoint Placement for Reliability and Availability. 183-192 - Muhammed Emin Ozturk, Marissa Renardy, Yukun Li, Gagan Agrawal, Ching-Shan Chou:
A Novel Approach for Handling Soft Error in Conjugate Gradients. 193-202 - Burcu Ozcelik Mutlu, Gokcen Kestor, Joseph B. Manzano, Osman S. Unsal, Samrat Chatterjee, Sriram Krishnamoorthy:
Characterization of the Impact of Soft Errors on Iterative Methods. 203-214
Technical Session 5: Algorithms and Data Analysis
- Vasilios I. Kelefouras, Karim Djemame:
Workflow Simulation Aware and Multi-threading Effective Task Scheduling for Heterogeneous Computing. 215-224 - Xiaobo Zhu, Guangjun Wu, Hong Zhang, Shupeng Wang, Bingnan Ma:
Dynamic Count-Min Sketch for Analytical Queries Over Continuous Data Streams. 225-234 - Hao Lu, Sudip K. Seal, Jonathan D. Poplawsky:
Scalable Proximity-Based Methods for Large-Scale Analysis of Atom Probe Data. 235-244 - Sriram Srinivasan, Sara Riazi, Boyana Norris, Sajal K. Das, Sanjukta Bhowmick:
A Shared-Memory Parallel Algorithm for Updating Single-Source Shortest Paths in Large Dynamic Networks. 245-254 - Hariharan Devarajan, Anthony Kougkas, Prajwal Challa, Xian-He Sun:
Vidya: Performing Code-Block I/O Characterization for Data Access Optimization. 255-264 - Chao Li, Balaji Palanisamy:
Decentralized Privacy-Preserving Timed Execution in Blockchain-Based Smart Contract Platforms. 265-274
Keynote 3
- Srini Devadas:
Secure High-Performance Computer Architectures: Challenges and Opportunities. 275
Technical Session 6: Applications and System Tools
- Venkatesh-Prasad Ranganath, Daniel Andresen:
Why do Users Kill HPC Jobs? 276-283 - Damon Fenacci, Hans Vandierendonck, Dimitrios S. Nikolopoulos:
Code and Data Transformations to Address Garbage Collector Performance in Big Data Processing. 284-293 - Shaleen Garg, Kishore Kothapalli, Suresh Purini:
Share-a-GPU: Providing Simple and Effective Time-Sharing on GPUs. 294-303 - Gangyi Zhu, Gagan Agrawal:
A Performance Prediction Framework for Irregular Applications. 304-313 - Jia Guo, Gagan Agrawal:
Achieving Performance and Programmability for MapReduce(-Like) Frameworks. 314-323 - Vasudevan Rengasamy, Mahmut T. Kandemir, Paul Medvedev, Kamesh Madduri:
Parallel Read Partitioning for Concurrent Assembly of Metagenomic Data. 324-333
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.