default search action
Pavan Balaji
Person information
- affiliation: Argonne National Laboratory
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i4]Sitian Chen, Haobin Tan, Amelie Chi Zhou, Yusen Li, Pavan Balaji:
UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture. CoRR abs/2406.13941 (2024) - [i3]Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao:
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression. CoRR abs/2407.04272 (2024) - 2023
- [j63]Chen Wang, Yanfei Guo, Pavan Balaji, Marc Snir:
Near-Lossless MPI Tracing and Proxy Application Autogeneration. IEEE Trans. Parallel Distributed Syst. 34(1): 123-140 (2023) - [e13]Yogesh Simmhan, Ilkay Altintas, Ana Lucia Varbanescu, Pavan Balaji, Abhinandan S. Prasad, Lorenzo Carnevale:
23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2023, Bangalore, India, May 1-4, 2023. IEEE 2023, ISBN 979-8-3503-0119-9 [contents] - [e12]Yogesh Simmhan, Ilkay Altintas, Ana Lucia Varbanescu, Pavan Balaji, Abhinandan S. Prasad, Lorenzo Carnevale:
23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2023 - Workshops, Bangalore, India, May 1-4, 2023. IEEE 2023, ISBN 979-8-3503-0208-0 [contents] - 2021
- [j62]William Gropp, Rajeev Thakur, Pavan Balaji:
Translational research in the MPICH project. J. Comput. Sci. 52: 101203 (2021) - [j61]Pavan Balaji, Jidong Zhai, Min Si:
Guest Editorial. IEEE Trans. Parallel Distributed Syst. 32(7): 1511-1512 (2021) - [j60]Rohit Zambre, Damodar Sahasrabudhe, Hui Zhou, Martin Berzins, Aparna Chandramowlishwaran, Pavan Balaji:
Logically Parallel Communication for Fast MPI+Threads Applications. IEEE Trans. Parallel Distributed Syst. 32(12): 3038-3052 (2021) - [c160]Sayan Ghosh, Yanfei Guo, Pavan Balaji, Assefaw H. Gebremedhin:
RMACXX: An Efficient High-Level C++ Interface over MPI-3 RMA. CCGRID 2021: 143-155 - [c159]Kaiming Ouyang, Min Si, Atsushi Hori, Zizhong Chen, Pavan Balaji:
Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication. CLUSTER 2021: 516-527 - [c158]Min Si, Huansong Fu, Jeff R. Hammond, Pavan Balaji:
OpenSHMEM over MPI as a Performance Contender: Thorough Analysis and Optimizations. OpenSHMEM 2021: 39-60 - [c157]Shumpei Shiina, Shintaro Iwasaki, Kenjiro Taura, Pavan Balaji:
Lightweight preemptive user-level threads. PPoPP 2021: 374-388 - [c156]Chen Wang, Pavan Balaji, Marc Snir:
Pilgrim: scalable and (near) lossless MPI tracing. SC 2021: 52 - 2020
- [j59]Adrián Castelló, Rafael Mayo Gual, Sangmin Seo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña:
Analysis of Threading Libraries for High Performance Computing. IEEE Trans. Computers 69(9): 1279-1292 (2020) - [j58]Shintaro Iwasaki, Abdelhalim Amer, Kenjiro Taura, Pavan Balaji:
Analyzing the Performance Trade-Off in Implementing User-Level Threads. IEEE Trans. Parallel Distributed Syst. 31(8): 1859-1877 (2020) - [j57]Tao Gao, Yanfei Guo, Boyu Zhang, Pietro Cicotti, Yutong Lu, Pavan Balaji, Michela Taufer:
Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems. IEEE Trans. Parallel Distributed Syst. 31(12): 2734-2748 (2020) - [c155]Xiaomin Zhu, Yaqian Zhao, Pavan Balaji:
Probing the Underlying Implementation Mechanisms of SW26010. HPCC/DSS/SmartCity 2020: 694-699 - [c154]Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji:
How I learned to stop worrying about user-visible endpoints and love MPI. ICS 2020: 35:1-35:13 - [c153]Noah Evans, Jan Ciesko, Stephen L. Olivier, Howard Pritchard, Shintaro Iwasaki, Ken Raffenetti, Pavan Balaji:
Implementing Flexible Threading Support in Open MPI. ExaMPI@SC 2020: 21-30 - [c152]Kaiming Ouyang, Min Si, Atsushi Hori, Zizhong Chen, Pavan Balaji:
CAB-MPI: exploring interprocess work-stealing towards balanced MPI communication. SC 2020: 36 - [i2]Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji:
Scalable Communication Endpoints for MPI+Threads Applications. CoRR abs/2002.02509 (2020) - [i1]Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji:
How I Learned to Stop Worrying About User-Visible Endpoints and Love MPI. CoRR abs/2005.00263 (2020)
2010 – 2019
- 2019
- [j56]Abhinav Vishnu, Pavan Balaji, Yong Chen:
Guest Editor's Introduction: P2S2: SI 2016. Parallel Comput. 82: 1-2 (2019) - [j55]Pavan Balaji, Abhinav Vishnu, Yong Chen:
Foreword to the special issue for the Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2 2017). Parallel Comput. 83: 1-2 (2019) - [j54]Pavan Balaji, Marc Casas:
Special issue on the message passing interface. Parallel Comput. 86: 14-15 (2019) - [j53]Min Si, Zhiyi Huang, Pavan Balaji:
International workshop on programming models and applications for multicores and manycores (PMAM 2018). Parallel Comput. 88 (2019) - [j52]Sarunya Pumma, Min Si, Wu-Chun Feng, Pavan Balaji:
Scalable Deep Learning via I/O Analysis and Optimization. ACM Trans. Parallel Comput. 6(2): 6:1-6:34 (2019) - [c151]Shintaro Iwasaki, Abdelhalim Amer, Kenjiro Taura, Sangmin Seo, Pavan Balaji:
BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads. PACT 2019: 29-42 - [c150]Xiaomin Zhu, Yunhui Zeng, Yanjie Wei, Shengzhong Feng, Weiguo Liu, Pavan Balaji:
An Auto Code Generator for Stencil on SW26010. HPCC/SmartCity/DSS 2019: 182-190 - [c149]Seonmyeong Bak, Yanfei Guo, Pavan Balaji, Vivek Sarkar:
Optimized Execution of Parallel Loops via User-Defined Scheduling Policies. ICPP 2019: 38:1-38:10 - [c148]Abdelhalim Amer, Charles Archer, Michael Blocksome, Chongxiao Cao, Michael Chuvelev, Hajime Fujita, Maria Garzaran, Yanfei Guo, Jeff R. Hammond, Shintaro Iwasaki, Kenneth J. Raffenetti, Mikhail Shiryaev, Min Si, Kenjiro Taura, Sagar Thapaliya, Pavan Balaji:
Software combining to mitigate multithreaded MPI contention. ICS 2019: 367-379 - [c147]Joshua Hoke Davis, Tao Gao, Sunita Chandrasekaran, Heike Jagode, Anthony Danalis, Jack J. Dongarra, Pavan Balaji, Michela Taufer:
Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI. PARCO 2019: 287-298 - [e11]Zheng Xiao, Laurence T. Yang, Pavan Balaji, Tao Li, Keqin Li, Albert Y. Zomaya:
21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019, Zhangjiajie, China, August 10-12, 2019. IEEE 2019, ISBN 978-1-7281-2058-4 [contents] - [e10]Michela Taufer, Pavan Balaji, Antonio J. Peña:
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2019, Denver, Colorado, USA, November 17-19, 2019. ACM 2019, ISBN 978-1-4503-6229-0 [contents] - 2018
- [j51]Adrián Castelló, Rafael Mayo, Kevin Sala, Vicenç Beltran, Pavan Balaji, Antonio J. Peña:
On the adequacy of lightweight thread approaches for high-level parallel programming models. Future Gener. Comput. Syst. 84: 22-31 (2018) - [j50]Pavan Balaji, Kai-Cheung Leung:
Introduction. Int. J. High Perform. Comput. Appl. 32(6) (2018) - [j49]Adrián Castelló, Antonio J. Peña, Rafael Mayo, Judit Planas, Enrique S. Quintana-Ortí, Pavan Balaji:
Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models. J. Supercomput. 74(11): 5628-5642 (2018) - [j48]Abdelhalim Amer, Huiwei Lu, Pavan Balaji, Milind Chabbi, Yanjie Wei, Jeff R. Hammond, Satoshi Matsuoka:
Lock Contention Management in Multithreaded MPI. ACM Trans. Parallel Comput. 5(3): 12:1-12:21 (2018) - [j47]Sangmin Seo, Abdelhalim Amer, Pavan Balaji, Cyril Bordage, George Bosilca, Alex Brooks, Philip H. Carns, Adrián Castelló, Damien Genet, Thomas Hérault, Shintaro Iwasaki, Prateek Jindal, Laxmikant V. Kalé, Sriram Krishnamoorthy, Jonathan Lifflander, Huiwei Lu, Esteban Meneses, Marc Snir, Yanhua Sun, Kenjiro Taura, Peter H. Beckman:
Argobots: A Lightweight Low-Level Threading and Tasking Framework. IEEE Trans. Parallel Distributed Syst. 29(3): 512-526 (2018) - [j46]Min Si, Antonio J. Peña, Jeff R. Hammond, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa:
Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications. IEEE Trans. Parallel Distributed Syst. 29(9): 1975-1989 (2018) - [c146]Jianqiu Ge, Ning Guo, Jintao Meng, Bingqiang Wang, Pavan Balaji, Shengzhong Feng, Jiaxiu Zhou, Yanjie Wei:
K-mer Counting for Genomic Big Data. BigData Congress 2018: 345-351 - [c145]Atsushi Hori, Min Si, Balazs Gerofi, Masamichi Takagi, Jai Dayal, Pavan Balaji, Yutaka Ishikawa:
Process-in-process: techniques for practical address-space sharing. HPDC 2018: 131-143 - [c144]Tao Gao, Yanfei Guo, Boyu Zhang, Pietro Cicotti, Yutong Lu, Pavan Balaji, Michela Taufer:
On the Power of Combiner Optimizations in MapReduce Over MPI Workflows. ICPADS 2018: 441-448 - [c143]Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji:
Scalable Communication Endpoints for MPI+Threads Applications. ICPADS 2018: 803-812 - [c142]Shintaro Iwasaki, Abdelhalim Amer, Kenjiro Taura, Pavan Balaji:
Lessons learned from analyzing dynamic promotion for user-level threading. SC 2018: 23:1-23:12 - [c141]Sudheer Chunduri, Scott Parker, Pavan Balaji, Kevin Harms, Kalyan Kumaran:
Characterization of MPI usage on a production supercomputer. SC 2018: 30:1-30:15 - [e9]Quan Chen, Zhiyi Huang, Pavan Balaji:
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2018, February 25, 2018, Vienna, Austria. ACM 2018 [contents] - 2017
- [j45]Pavan Balaji, Kai-Cheung Leung:
Foreword to the Special Issue of the workshop on the seventh international workshop on programming models and applications for multicores and manycores (PMAM 2016). Concurr. Comput. Pract. Exp. 29(15) (2017) - [j44]Pavan Balaji, Zhiyi Huang:
Special issue on programming models and applications for multicores and manycores. Int. J. High Perform. Comput. Appl. 31(5): 359-360 (2017) - [j43]Andrew A. Chien, Pavan Balaji, Nan Dun, Aiman Fang, Hajime Fujita, Kamil Iskra, Zachary A. Rubenstein, Ziming Zheng, Jeff R. Hammond, Ignacio Laguna, D. Richards, Anshu Dubey, Brian van Straalen, Mark Hoemmen, Michael A. Heroux, Keita Teranishi, Andrew R. Siegel:
Exploring versioned distributed arrays for resilience in scientific applications. Int. J. High Perform. Comput. Appl. 31(6): 564-590 (2017) - [j42]Boyu Zhang, Trilce Estrada, Pietro Cicotti, Pavan Balaji, Michela Taufer:
Enabling scalable and accurate clustering of distributed ligand geometries on supercomputers. Parallel Comput. 63: 38-60 (2017) - [c140]Hoang-Vu Dang, Sangmin Seo, Abdelhalim Amer, Pavan Balaji:
Advanced Thread Synchronization for Multithreaded MPI Implementations. CCGrid 2017: 314-324 - [c139]Nikela Papadopoulou, Lena Oden, Pavan Balaji:
A Performance Study of UCX over InfiniBand. CCGrid 2017: 345-354 - [c138]Jintao Meng, Ning Guo, Jianqiu Ge, Yanjie Wei, Pavan Balaji, Bingqiang Wang:
Scalable Assembly for Massive Genomic Graphs. CCGrid 2017: 665-670 - [c137]Xiaohui Duan, Kai Xu, Yuandong Chan, Christian Hundt, Bertil Schmidt, Pavan Balaji, Weiguo Liu:
S-Aligner: Ultrascalable Read Mapping on Sunway Taihu Light. CLUSTER 2017: 36-46 - [c136]Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña:
GLT: A Unified API for Lightweight Thread Libraries. Euro-Par 2017: 470-481 - [c135]Seyed Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, Ahmad Afsahi:
Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives. HiPC 2017: 348-357 - [c134]Min Si, Pavan Balaji:
Process-Based Asynchronous Progress Model for MPI Point-to-Point Communication. HPCC/SmartCity/DSS 2017: 206-214 - [c133]Sarunya Pumma, Min Si, Wu-chun Feng, Pavan Balaji:
Towards Scalable Deep Learning via I/O Analysis and Optimization. HPCC/SmartCity/DSS 2017: 223-230 - [c132]Tao Gao, Yanfei Guo, Yanjie Wei, Bingqiang Wang, Yutong Lu, Pietro Cicotti, Pavan Balaji, Michela Taufer:
Bloomfish: A Highly Scalable Distributed K-mer Counting Framework. ICPADS 2017: 170-179 - [c131]Lena Oden, Pavan Balaji:
Hexe: A Toolkit for Heterogeneous Memory Management. ICPADS 2017: 656-663 - [c130]Robert Latham, Leonardo Bautista-Gomez, Pavan Balaji:
Portable Topology-Aware MPI-I/O. ICPADS 2017: 710-719 - [c129]Sarunya Pumma, Min Si, Wu-chun Feng, Pavan Balaji:
Parallel I/O Optimizations for Scalable Deep Learning. ICPADS 2017: 720-729 - [c128]Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña:
GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations. ICPP 2017: 60-69 - [c127]Yanfei Guo, Charles J. Archer, Michael Blocksome, Scott Parker, Wesley Bland, Ken Raffenetti, Pavan Balaji:
Memory Compression Techniques for Network Address Management in MPI. IPDPS 2017: 1008-1017 - [c126]Tao Gao, Yanfei Guo, Boyu Zhang, Pietro Cicotti, Yutong Lu, Pavan Balaji, Michela Taufer:
Mimir: Memory-Efficient and Scalable MapReduce for Large Supercomputing Systems. IPDPS 2017: 1098-1108 - [c125]Pavan Balaji:
PDSEC Keynote. IPDPS Workshops 2017: 1117 - [c124]Ken Raffenetti, Abdelhalim Amer, Lena Oden, Charles Archer, Wesley Bland, Hajime Fujita, Yanfei Guo, Tomislav Janjusic, Dmitry Durnov, Michael Blocksome, Min Si, Sangmin Seo, Akhil Langer, Gengbin Zheng, Masamichi Takagi, Paul K. Coffman, Jithin Jose, Sayantan Sur, Alexander Sannikov, Sergey Oblomov, Michael Chuvelev, Masayuki Hatanaka, Xin Zhao, Paul F. Fischer, Thilina Rathnayake, Matthew Otten, Misun Min, Pavan Balaji:
Why is MPI so slow?: analyzing the fundamental limits in implementing MPI-3.1. SC 2017: 62 - [e8]Antonio J. Peña, Pavan Balaji, William Gropp, Rajeev Thakur:
Proceedings of the 24th European MPI Users' Group Meeting, EuroMPI/USA 2017, Chicago, IL, USA, September 25-28, 2017. ACM 2017, ISBN 978-1-4503-4849-2 [contents] - [e7]Julian M. Kunkel, Rio Yokota, Pavan Balaji, David E. Keyes:
High Performance Computing - 32nd International Conference, ISC High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings. Lecture Notes in Computer Science 10266, Springer 2017, ISBN 978-3-319-58666-3 [contents] - 2016
- [j41]Abdul Hameed, Alireza Khoshkbarforoushha, Rajiv Ranjan, Prem Prakash Jayaraman, Joanna Kolodziej, Pavan Balaji, Sherali Zeadally, Qutaibah Marwan Malluhi, Nikos Tziritas, Abhinav Vishnu, Samee U. Khan, Albert Y. Zomaya:
A survey and taxonomy on energy efficient resource allocation techniques for cloud computing systems. Computing 98(7): 751-774 (2016) - [j40]Pavan Balaji, Zhiyi Huang:
Programming models and applications for multicores and manycores. Concurr. Comput. Pract. Exp. 28(2): 453-454 (2016) - [j39]Humayun Arafat, James Dinan, Sriram Krishnamoorthy, Pavan Balaji, P. Sadayappan:
Work stealing for GPU-accelerated parallel programs in a global address space framework. Concurr. Comput. Pract. Exp. 28(13): 3637-3654 (2016) - [j38]James Dinan, Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur:
An implementation and evaluation of the MPI 3.0 one-sided communication interface. Concurr. Comput. Pract. Exp. 28(17): 4385-4404 (2016) - [j37]Saif Ur Rehman Malik, Samee U. Khan, Sam J. Ewen, Nikos Tziritas, Joanna Kolodziej, Albert Y. Zomaya, Sajjad Ahmad Madani, Nasro Min-Allah, Lizhe Wang, Cheng-Zhong Xu, Qutaibah M. Malluhi, Johnatan E. Pecero, Pavan Balaji, Abhinav Vishnu, Rajiv Ranjan, Sherali Zeadally, Hongxiang Li:
Performance analysis of data intensive cloud systems based on data management and replication: a survey. Distributed Parallel Databases 34(2): 179-215 (2016) - [j36]Pavan Balaji, Abhinav Vishnu, Yong Chen:
Special Issue on Parallel Programming Models and Systems Software for High-End Computing. Parallel Comput. 51: 1-2 (2016) - [j35]Antonio J. Peña, Pavan Balaji:
A data-oriented profiler to assist in data partitioning and distribution for heterogeneous memory in HPC. Parallel Comput. 51: 46-55 (2016) - [j34]Michela Taufer, Pavan Balaji, Satoshi Matsuoka:
Special Issue on Cluster Computing. Parallel Comput. 58: 25-26 (2016) - [j33]Ashwin M. Aji, Antonio J. Peña, Pavan Balaji, Wu-chun Feng:
MultiCL: Enabling automatic scheduling for task-parallel workloads in OpenCL. Parallel Comput. 58: 37-55 (2016) - [j32]Junaid Shuja, Kashif Bilal, Sajjad Ahmad Madani, Mazliza Othman, Rajiv Ranjan, Pavan Balaji, Samee Ullah Khan:
Survey of Techniques and Architectures for Designing Energy-Efficient Data Centers. IEEE Syst. J. 10(2): 507-519 (2016) - [j31]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Karthik Murthy, Milind Chabbi, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
MPI-ACC: Accelerator-Aware MPI for Scientific Applications. IEEE Trans. Parallel Distributed Syst. 27(5): 1401-1414 (2016) - [c123]Jichi Guo, Qing Yi, Jiayuan Meng, Junchao Zhang, Pavan Balaji:
Compiler-Assisted Overlapping of Communication and Computation in MPI Applications. CLUSTER 2016: 60-69 - [c122]Adrián Castelló, Antonio J. Peña, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí:
A Review of Lightweight Thread Approaches for High Performance Computing. CLUSTER 2016: 471-480 - [c121]Sayan Ghosh, Jeff R. Hammond, Antonio J. Peña, Pavan Balaji, Assefaw Hadish Gebremedhin, Barbara M. Chapman:
One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental. ICPP 2016: 185-194 - [c120]Jintao Meng, Sangmin Seo, Pavan Balaji, Yanjie Wei, Bingqiang Wang, Shengzhong Feng:
SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale. ICPP 2016: 195-204 - [c119]Xin Zhao, Pavan Balaji, William Gropp:
Scalability Challenges in Current MPI One-Sided Implementations. ISPDC 2016: 38-47 - [c118]Abdelhalim Amer, Satoshi Matsuoka, Miquel Pericàs, Naoya Maruyama, Kenjiro Taura, Rio Yokota, Pavan Balaji:
Scaling FMM with Data-Driven OpenMP Tasks on Multicore Architectures. IWOMP 2016: 156-170 - [e6]Pavan Balaji, Kai-Cheung Leung:
Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2016, Barcelona, Spain, March 12-16, 2016. ACM 2016, ISBN 978-1-4503-4196-7 [contents] - [e5]Julian M. Kunkel, Pavan Balaji, Jack J. Dongarra:
High Performance Computing - 31st International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19-23, 2016, Proceedings. Lecture Notes in Computer Science 9697, Springer 2016, ISBN 978-3-319-41320-4 [contents] - 2015
- [j30]Pavan Balaji, Lisong Xu, Changjun Jiang, Xiaobo Zhou:
Introduction Special Section of ICCCN 2014 Conference. Comput. Commun. 69: 38-39 (2015) - [j29]Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, Ahmad Afsahi:
Scalable connectionless RDMA over unreliable datagrams. Parallel Comput. 48: 15-39 (2015) - [j28]Torsten Hoefler, James Dinan, Rajeev Thakur, Brian Barrett, Pavan Balaji, William Gropp, Keith D. Underwood:
Remote Memory Access Programming in MPI-3. ACM Trans. Parallel Comput. 2(2): 9:1-9:26 (2015) - [c117]Min Si, Pavan Balaji, Yutaka Ishikawa:
Techniques for Enabling Highly Efficient Message Passing on Many-Core Architectures. CCGRID 2015: 697-700 - [c116]Xin Zhao, Pavan Balaji, William Gropp:
Runtime Support for Irregular Computation in MPI-Based Applications. CCGRID 2015: 701-704 - [c115]Jintao Meng, Yanjie Wei, Sangmin Seo, Pavan Balaji:
SWAP-Assembler 2: Scalable Genome Assembler towards Millions of Cores - Practice and Experience. CCGRID 2015: 769-772 - [c114]Min Si, Antonio J. Peña, Jeff R. Hammond, Pavan Balaji, Yutaka Ishikawa:
Scaling NWChem with Efficient and Portable Asynchronous Communication in MPI RMA. CCGRID 2015: 811-816 - [c113]Boyu Zhang, Trilce Estrada, Pietro Cicotti, Pavan Balaji, Michela Taufer:
Accurate Scoring of Drug Conformations at the Extreme Scale. CCGRID 2015: 817-822 - [c112]Abdelhalim Amer, Huiwei Lu, Pavan Balaji, Satoshi Matsuoka:
Characterizing MPI and Hybrid MPI+Threads Applications at Scale: Case Study with BFS. CCGRID 2015: 1075-1083 - [c111]Sangmin Seo, Robert Latham, Junchao Zhang, Pavan Balaji:
Implementation and Evaluation of MPI Nonblocking Collective I/O. CCGRID 2015: 1084-1091 - [c110]Xiaomin Zhu, Junchao Zhang, Kazutomo Yoshii, Shigang Li, Yunquan Zhang, Pavan Balaji:
Analyzing MPI-3.0 Process-Level Shared Memory: A Case Study with Stencil Computations. CCGRID 2015: 1099-1106 - [c109]Wesley Bland, Huiwei Lu, Sangmin Seo, Pavan Balaji:
Lessons Learned Implementing User-Level Failure Mitigation in MPICH. CCGRID 2015: 1123-1126 - [c108]Antonio J. Peña, Pavan Balaji:
Understanding Data Access Patterns Using Object-Differentiated Memory Profiling. CCGRID 2015: 1143-1146 - [c107]Ken Raffenetti, Antonio J. Peña, Pavan Balaji:
Toward Implementing Robust Support for Portals 4 Networks in MPICH. CCGRID 2015: 1173-1176 - [c106]Ashwin Mandayam Aji, Antonio J. Peña, Pavan Balaji, Wu-chun Feng:
Automatic Command Queue Scheduling for Task-Parallel Workloads in OpenCL. CLUSTER 2015: 42-51 - [c105]Adrián Castelló, Antonio J. Peña, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí:
Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA. CLUSTER 2015: 92-95 - [c104]Hajime Fujita, Kamil Iskra, Pavan Balaji, Andrew A. Chien:
Empirical Comparison of Three Versioning Architectures. CLUSTER 2015: 456-459 - [c103]Nan Dun, Hajime Fujita, Aiman Fang, Yan Liu, Andrew A. Chien, Pavan Balaji, Kamil Iskra, Wesley Bland, Andrew R. Siegel:
Flexible Error Recovery Using Versions in Global View Resilience. CLUSTER 2015: 512-513 - [c102]Huiwei Lu, Sangmin Seo, Pavan Balaji:
MPI+ULT: Overlapping Communication and Computation with User-Level Threads. HPCC/CSS/ICESS 2015: 444-454 - [c101]Andrew A. Chien, Pavan Balaji, Peter H. Beckman, Nan Dun, Aiman Fang, Hajime Fujita, Kamil Iskra, Zachary A. Rubenstein, Ziming Zheng, Rob Schreiber, Jeff R. Hammond, James Dinan, Ignacio Laguna, D. Richards, Anshu Dubey, Brian van Straalen, Mark Hoemmen, Michael A. Heroux, Keita Teranishi, Andrew R. Siegel:
Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience. ICCS 2015: 29-38 - [c100]Hajime Fujita, Kamil Iskra, Pavan Balaji, Andrew A. Chien:
Versioning Architectures for Local and Global Memory. ICPADS 2015: 515-524 - [c99]Ramesh Hariharan, Ananth Kalyanaraman, Michela Taufer, Trilce Estrada, Pietro Cicotti, Pavan Balaji:
HiCOMB 2015 Keynote and Invited Talks. IPDPS Workshops 2015: 331 - [c98]James Dinan, Wenguang Chen, Xiaosong Ma, Pavan Balaji, Satoshi Matsuoka, Jiayuan Meng, Yunquan Zhang:
AsHES Introduction and Committees. IPDPS Workshops 2015: 591-592 - [c97]Min Si, Antonio J. Peña, Jeff R. Hammond, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa:
Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures. IPDPS 2015: 665-676 - [c96]Abdelhalim Amer, Huiwei Lu, Yanjie Wei, Pavan Balaji, Satoshi Matsuoka:
MPI+Threads: runtime contention and remedies. PPoPP 2015: 239-248 - [c95]Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Kiran Pamnany, Jeff R. Hammond, Pavan Balaji, Dipankar Das, Jongsoo Park, Bálint Joó:
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading. SC 2015: 30:1-30:12 - [c94]Yanfei Guo, Wesley Bland, Pavan Balaji, Xiaobo Zhou:
Fault tolerant MapReduce-MPI for HPC clusters. SC 2015: 34:1-34:12 - [c93]Antonio J. Peña, Wesley Bland, Pavan Balaji:
VOCL-FT: introducing techniques for efficient soft error coprocessor recovery. SC 2015: 71:1-71:12 - [e4]Pavan Balaji, Minyi Guo, Zhiyi Huang:
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2015, San Francisco, CA, USA, February 7-8, 2015. ACM 2015, ISBN 978-1-4503-3404-4 [contents] - [r1]Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, Ahmad Afsahi:
Scalable Network Communication Using Unreliable RDMA. Handbook on Data Centers 2015: 393-424 - 2014
- [j27]Jintao Meng, Bingqiang Wang, Yanjie Wei, Shengzhong Feng, Pavan Balaji:
SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores. BMC Bioinform. 15(S-9): S2 (2014) - [j26]Marc Snir, Robert W. Wisniewski, Jacob A. Abraham, Sarita V. Adve, Saurabh Bagchi, Pavan Balaji, James F. Belak, Pradip Bose, Franck Cappello, Bill Carlson, Andrew A. Chien, Paul Coteus, Nathan DeBardeleben, Pedro C. Diniz, Christian Engelmann, Mattan Erez, Saverio Fazzari, Al Geist, Rinku Gupta, Fred Johnson, Sriram Krishnamoorthy, Sven Leyffer, Dean Liberty, Subhasish Mitra, Todd S. Munson, Rob Schreiber, Jon Stearley, Eric Van Hensbergen:
Addressing failures in exascale computing. Int. J. High Perform. Comput. Appl. 28(2): 129-173 (2014) - [j25]James Dinan, Ryan E. Grant, Pavan Balaji, David Goodell, Douglas Miller, Marc Snir, Rajeev Thakur:
Enabling communication concurrency through flexible MPI endpoints. Int. J. High Perform. Comput. Appl. 28(4): 390-405 (2014) - [j24]Pavan Balaji, Zhiyi Huang:
Special issue on programming models and applications for multicores and manycores - Guest Editors' Introduction. Parallel Comput. 40(2): 33-34 (2014) - [j23]John Jenkins, James Dinan, Pavan Balaji, Tom Peterka, Nagiza F. Samatova, Rajeev Thakur:
Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data. IEEE Trans. Parallel Distributed Syst. 25(10): 2627-2637 (2014) - [c92]Antonio J. Peña, Pavan Balaji:
Toward the efficient use of multiple explicitly managed memory subsystems. CLUSTER 2014: 123-131 - [c91]David Ozog, Allen D. Malony, Jeff R. Hammond, Pavan Balaji:
WorkQ: A many-core producer/consumer execution model applied to PGAS computations. ICPADS 2014: 632-639 - [c90]Antonio J. Peña, Pavan Balaji:
A Framework for Tracking Memory Accesses in Scientific Applications. ICPP Workshops 2014: 235-244 - [c89]Min Si, Antonio J. Peña, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa:
MT-MPI: multithreaded MPI for many-core environments. ICS 2014: 125-134 - [c88]Chaoran Yang, Wesley Bland, John M. Mellor-Crummey, Pavan Balaji:
Portable, MPI-interoperable coarray fortran. PPoPP 2014: 81-92 - [c87]Junchao Zhang, Bill Long, Kenneth Raffenetti, Pavan Balaji:
Implementing the MPI-3.0 Fortran 2008 Binding. EuroMPI/ASIA 2014: 1 - [c86]Wesley Bland, Kenneth Raffenetti, Pavan Balaji:
Simplifying the recovery model of user-level failure mitigation. ExaMPI@SC 2014: 20-25 - [c85]Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi:
Nonblocking Epochs in MPI One-Sided Communication. SC 2014: 475-486 - [c84]Zhezhe Chen, James Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, Feng Qin:
MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications. SC 2014: 499-510 - [e3]Pavan Balaji, Minyi Guo, Zhiyi Huang:
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2014, Orlando, Florida, USA, February 15, 2014. ACM 2014, ISBN 978-1-4503-2657-5 [contents] - 2013
- [j22]Giorgio Valentini, Walter Lassonde, Samee Ullah Khan, Nasro Min-Allah, Sajjad Ahmad Madani, Juan Li, Limin Zhang, Lizhe Wang, Nasir Ghani, Joanna Kolodziej, Hongxiang Li, Albert Y. Zomaya, Cheng-Zhong Xu, Pavan Balaji, Abhinav Vishnu, Frédéric Pinel, Johnatan E. Pecero, Dzmitry Kliazovich, Pascal Bouvry:
An overview of energy efficiency techniques in cluster computing systems. Clust. Comput. 16(1): 3-15 (2013) - [j21]Torsten Hoefler, James Dinan, Darius Buntinas, Pavan Balaji, Brian Barrett, Ron Brightwell, William Gropp, Vivek Kale, Rajeev Thakur:
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory. Computing 95(12): 1121-1136 (2013) - [j20]Pavan Balaji, Rajkumar Buyya:
Guest editors' introduction: Special issue on Cluster, Grid, and Cloud Computing. Future Gener. Comput. Syst. 29(8): 2220-2221 (2013) - [j19]Pavan Balaji, Satoshi Matsuoka:
Guest Editors' Introduction: Special Issue on Applications for the Heterogeneous Computing Era. Int. J. High Perform. Comput. Appl. 27(2): 87-88 (2013) - [j18]Hameed Hussain, Saif Ur Rehman Malik, Abdul Hameed, Samee Ullah Khan, Gage Bickler, Nasro Min-Allah, Muhammad Bilal Qureshi, Limin Zhang, Yongji Wang, Nasir Ghani, Joanna Kolodziej, Albert Y. Zomaya, Cheng-Zhong Xu, Pavan Balaji, Abhinav Vishnu, Frédéric Pinel, Johnatan E. Pecero, Dzmitry Kliazovich, Pascal Bouvry, Hongxiang Li, Lizhe Wang, Dan Chen, Ammar Rayes:
A survey on resource allocation in high performance distributed computing systems. Parallel Comput. 39(11): 709-736 (2013) - [j17]Yong Chen, Pavan Balaji, Abhinav Vishnu:
Special issue on programming models, systems software, and tools for High-End Computing. Parallel Comput. 39(12): 751-752 (2013) - [j16]Abhinav Vishnu, Pavan Balaji, Yong Chen:
Guest Editors' introduction. J. Supercomput. 63(2): 323-325 (2013) - [j15]Abhinav Vishnu, Shuaiwen Song, Andres Marquez, Kevin J. Barker, Darren J. Kerbyson, Kirk W. Cameron, Pavan Balaji:
Designing energy efficient communication runtime systems: a view from PGAS models. J. Supercomput. 63(3): 691-709 (2013) - [c83]Xin Zhao, Darius Buntinas, Judicael A. Zounmevo, James Dinan, David Goodell, Pavan Balaji, Rajeev Thakur, Ahmad Afsahi, William Gropp:
Toward Asynchronous and MPI-Interoperable Active Messages. CCGRID 2013: 87-94 - [c82]Jing Zhang, Heshan Lin, Pavan Balaji, Wu-chun Feng:
Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures. CCGRID 2013: 377-384 - [c81]Xin Zhao, Pavan Balaji, William Gropp, Rajeev Thakur:
Optimization Strategies for MPI-Interoperable Active Messages. DASC 2013: 508-515 - [c80]Naoya Maruyama, Leif Kobbelt, Pavan Balaji, Nikola Puzovic, Samuel Thibault, Kun Zhou:
Topic 15: GPU and Accelerator Computing - (Introduction). Euro-Par 2013: 800 - [c79]Pavan Balaji, Dries Kimpe:
On the Reproducibility of MPI Reduction Operations. HPCC/EUC 2013: 407-414 - [c78]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
On the efficacy of GPU-integrated MPI for scientific applications. HPDC 2013: 191-202 - [c77]Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, Xiaobo Zhou:
pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments. ICDCS 2013: 145-154 - [c76]Xin Zhao, Pavan Balaji, William Gropp, Rajeev Thakur:
MPI-Interoperable Generalized Active Messages. ICPADS 2013: 200-207 - [c75]Lokendra S. Panwar, Ashwin M. Aji, Jiayuan Meng, Pavan Balaji, Wu-chun Feng:
Online Performance Projection for Clusters with Heterogeneous GPUs. ICPADS 2013: 283-290 - [c74]David Ozog, Jeff R. Hammond, James Dinan, Pavan Balaji, Sameer Shende, Allen D. Malony:
Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions. ICPP 2013: 30-39 - [c73]Md. Ziaul Haque, Qing Yi, James Dinan, Pavan Balaji:
Enhancing Performance Portability of MPI Applications through Annotation-Based Transformations. ICPP 2013: 631-640 - [c72]David Ozog, Sameer Shende, Allen D. Malony, Jeff R. Hammond, James Dinan, Pavan Balaji:
Inspector/executor load balancing algorithms for block-sparse tensor contractions. ICS 2013: 483-484 - [c71]Ashwin M. Aji, Pavan Balaji, James Dinan, Wu-chun Feng, Rajeev Thakur:
Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming. IPDPS Workshops 2013: 1020-1029 - [c70]James Dinan, Pavan Balaji, David Goodell, Douglas Miller, Marc Snir, Rajeev Thakur:
Enabling MPI interoperability through flexible communication endpoints. EuroMPI 2013: 13-18 - [c69]Antonio J. Peña, Ralf G. Correa Carvalho, James Dinan, Pavan Balaji, Rajeev Thakur, William Gropp:
Analysis of topology-dependent MPI performance on Gemini networks. EuroMPI 2013: 61-66 - [c68]Jue Hong, Pavan Balaji, Gaojin Wen, Bibo Tu, Junming Yan, Cheng-Zhong Xu, Shengzhong Feng:
Container-Based Job Management for Fair Resource Sharing. ISC 2013: 290-301 - [e2]Pavan Balaji, Minyi Guo, Zhiyi Huang:
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2013, Shenzhen, China, February 23, 2013. ACM 2013, ISBN 978-1-4503-1908-9 [contents] - 2012
- [j14]Pavan Balaji, Jiayuan Meng:
Applications for the Heterogeneous Computing Era. Int. J. High Perform. Comput. Appl. 26(2): 146-147 (2012) - [c67]Shucai Xiao, Pavan Balaji, James Dinan, Qian Zhu, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, Wu-chun Feng:
Transparent Accelerator Migration in a Virtualized GPU Environment. CCGRID 2012: 124-131 - [c66]John Jenkins, James Dinan, Pavan Balaji, Nagiza F. Samatova, Rajeev Thakur:
Enabling Fast, Noncontiguous GPU Data Movement in Hybrid MPI+GPU Environments. CLUSTER 2012: 468-476 - [c65]Feng Ji, Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Rajeev Thakur, Wu-chun Feng, Xiaosong Ma:
DMA-Assisted, Intranode Communication in GPU Accelerated Systems. HPCC-ICESS 2012: 461-468 - [c64]Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Wu-chun Feng, Keith R. Bisset, Rajeev Thakur:
MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems. HPCC-ICESS 2012: 647-654 - [c63]James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju:
Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication. IPDPS 2012: 739-750 - [c62]Pavan Balaji:
ASHES Introduction. IPDPS Workshops 2012: 1827 - [c61]Feng Ji, Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Wu-chun Feng, Xiaosong Ma:
Efficient Intranode Communication in GPU-Accelerated Systems. IPDPS Workshops 2012: 1838-1847 - [c60]James Dinan, David Goodell, William Gropp, Rajeev Thakur, Pavan Balaji:
Efficient Multithreaded Context ID Allocation in MPI. EuroMPI 2012: 57-66 - [c59]Torsten Hoefler, James Dinan, Darius Buntinas, Pavan Balaji, Brian W. Barrett, Ron Brightwell, William Gropp, Vivek Kale, Rajeev Thakur:
Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming. EuroMPI 2012: 132-141 - 2011
- [j13]Pavan Balaji, Rinku Gupta, Abhinav Vishnu, Peter H. Beckman:
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems. Comput. Sci. Res. Dev. 26(3-4): 247-256 (2011) - [j12]Pavan Balaji, Abhinav Vishnu:
Special Issue on Programming Models and Systems Software Support for High-End Computing Applications. Int. J. High Perform. Comput. Appl. 25(2): 135-136 (2011) - [j11]Pavan Balaji, Abhinav Vishnu:
Special Issue on Programming Models, Software and Tools for High-End Computing. Int. J. High Perform. Comput. Appl. 25(4): 353-354 (2011) - [j10]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff:
Mpi on millions of Cores. Parallel Process. Lett. 21(1): 45-60 (2011) - [c58]Gaojin Wen, Jue Hong, Cheng-Zhong Xu, Pavan Balaji, Shengzhong Feng, Pingchuang Jiang:
Energy-aware hierarchical scheduling of applications in large scale data centers. CDC 2011: 158-165 - [c57]Rui Wang, Erlin Yao, Mingyu Chen, Guangming Tan, Pavan Balaji, Darius Buntinas:
Building algorithmically nonstop fault tolerant MPI programs. HiPC 2011: 1-9 - [c56]Ryan E. Grant, Mohammad J. Rashti, Ahmad Afsahi, Pavan Balaji:
RDMA Capable iWARP over Datagrams. IPDPS 2011: 628-639 - [c55]Abhinav Vishnu, Manojkumar Krishnan, Pavan Balaji:
Dynamic Time-Variant Connection Management for PGAS Models on InfiniBand. IPDPS Workshops 2011: 740-746 - [c54]Mohammad J. Rashti, Jonathan Green, Pavan Balaji, Ahmad Afsahi, William Gropp:
Multi-core and Network Aware MPI Topology Functions. EuroMPI 2011: 50-60 - [c53]James Dinan, Sriram Krishnamoorthy, Pavan Balaji, Jeff R. Hammond, Manojkumar Krishnan, Vinod Tipparaju, Abhinav Vishnu:
Noncollective Communicator Creation in MPI. EuroMPI 2011: 282-291 - [c52]James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju:
Poster: High-level, one-sided programming models on MPI: a case study with global arrays and NWChem. SC Companion 2011: 37-38 - 2010
- [j9]Pavan Balaji, Wu-chun Feng, Heshan Lin, Jeremy S. Archuleta, Satoshi Matsuoka, Andrew S. Warren, João Carlos Setubal, Ewing L. Lusk, Rajeev Thakur, Ian T. Foster, Daniel S. Katz, Shantenu Jha, K. Shinpaugh, Susan Coghlan, Daniel A. Reed:
Global-scale distributed I/O with ParaMEDIC. Concurr. Comput. Pract. Exp. 22(16): 2266-2281 (2010) - [j8]Pavan Balaji, Anthony Chan, William Gropp, Rajeev Thakur, Ewing L. Lusk:
The Importance of Non-Data-Communication Overheads in MPI. Int. J. High Perform. Comput. Appl. 24(1): 5-15 (2010) - [j7]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur:
Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming. Int. J. High Perform. Comput. Appl. 24(1): 49-57 (2010) - [j6]Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, Rajeev Thakur, William Gropp:
A Pipelined Algorithm for Large, Irregular All-Gather Problems. Int. J. High Perform. Comput. Appl. 24(1): 58-68 (2010) - [c51]James Dinan, Pavan Balaji, Ewing L. Lusk, P. Sadayappan, Rajeev Thakur:
Hybrid parallel programming with MPI and unified parallel C. Conf. Computing Frontiers 2010: 177-186 - [c50]David Goodell, Pavan Balaji, Darius Buntinas, Gábor Dózsa, William Gropp, Sameer Kumar, Bronis R. de Supinski, Rajeev Thakur:
Minimizing MPI Resource Contention in Multithreaded Multicore Environments. CLUSTER 2010: 1-8 - [c49]Yang Jiao, Heshan Lin, Pavan Balaji, Wu-chun Feng:
Power and Performance Characterization of Computational Kernels on the GPU. GreenCom/CPSCom 2010: 221-228 - [c48]Abhinav Vishnu, Shuaiwen Song, Andres Marquez, Kevin J. Barker, Darren J. Kerbyson, Kirk W. Cameron, Pavan Balaji:
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models. GreenCom/CPSCom 2010: 229-236 - [c47]Mohammad J. Rashti, Ryan E. Grant, Ahmad Afsahi, Pavan Balaji:
iWARP redefined: Scalable connectionless communication over high-speed Ethernet. HiPC 2010: 1-10 - [c46]Abhinav Vishnu, Huub J. J. Van Dam, Wibe de Jong, Pavan Balaji, Shuaiwen Song:
Fault-tolerant communication runtime support for data-centric programming models. HiPC 2010: 1-9 - [c45]Dhabaleswar K. Panda, Sayantan Sur, Pavan Balaji:
Designing High-End Computing Systems with InfiniBand and High-Speed Ethernet. Hot Interconnects 2010: 125-127 - [c44]Ryan E. Grant, Pavan Balaji, Ahmad Afsahi:
A study of hardware assisted IP over InfiniBand and its impact on enterprise data center performance. ISPASS 2010: 144-153 - [c43]Gábor Dózsa, Sameer Kumar, Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Joe Ratterman, Rajeev Thakur:
Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems. EuroMPI 2010: 11-20 - [c42]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Jayesh Krishna, Ewing L. Lusk, Rajeev Thakur:
PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems. EuroMPI 2010: 31-41 - [c41]Jayesh Krishna, Pavan Balaji, Ewing L. Lusk, Rajeev Thakur, Fabian Tiller:
Implementing MPI on Windows: Comparison with Common Approaches on Unix. EuroMPI 2010: 160-169 - [e1]Fabrizio Petrini, Dennis Abts, Ron Brightwell, Pavan Balaji, Cyriel Minkenberg:
IEEE 18th Annual Symposium on High Performance Interconnects, HOTI 2010, Google Campus, Mountain View, California, USA, August 18-20, 2010. IEEE Computer Society 2010, ISBN 978-1-4244-8547-5 [contents]
2000 – 2009
- 2009
- [j5]Wu-chun Feng, Pavan Balaji:
Tools and Environments for Multicore and Many-Core Architectures. Computer 42(11): 26-27 (2009) - [j4]Ping Lai, Pavan Balaji, Rajeev Thakur, Dhabaleswar K. Panda:
ProOnE: a general-purpose protocol onload engine for multi- and many-core architectures. Comput. Sci. Res. Dev. 23(3-4): 133-142 (2009) - [j3]Pavan Balaji, Anthony Chan, Rajeev Thakur, William Gropp, Ewing L. Lusk:
Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P. Comput. Sci. Res. Dev. 24(1-2): 11-19 (2009) - [c40]Gopalakrishnan Santhanaraman, Pavan Balaji, K. Gopalakrishnan, Rajeev Thakur, William Gropp, Dhabaleswar K. Panda:
Natively Supporting True One-Sided Communication in. CCGRID 2009: 380-387 - [c39]Dhabaleswar K. Panda, Matthew J. Koop, Pavan Balaji:
Tutorial: Infiniband and 10-Gigabit Ethernet for Dummies. Hot Interconnects 2009 - [c38]Dhabaleswar K. Panda, Matthew J. Koop, Pavan Balaji:
Tutorial: Designing High-End Computing Systems with Infiniband and 10-Gigabit Ethernet. Hot Interconnects 2009 - [c37]Ryan E. Grant, Ahmad Afsahi, Pavan Balaji:
Evaluation of ConnectX Virtual Protocol Interconnect for Data Centers. ICPADS 2009: 57-64 - [c36]Pavan Balaji, Harish Naik, Narayan Desai:
Understanding Network Saturation Behavior on Large-Scale Blue Gene/P Systems. ICPADS 2009: 586-593 - [c35]Ajeet Singh, Pavan Balaji, Wu-chun Feng:
GePSeA: A General-Purpose Software Acceleration Framework for Lightweight Task Offloading. ICPP 2009: 261-268 - [c34]Narayan Desai, Darius Buntinas, Daniel Buettner, Pavan Balaji, Anthony Chan:
Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P. ICPP 2009: 333-339 - [c33]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff:
MPI on a Million Processors. PVM/MPI 2009: 20-30 - 2008
- [c32]Narayan Desai, Pavan Balaji, P. Sadayappan, Mohammad Islam:
Are nonblocking networks really needed for high-end-computing workloads? CLUSTER 2008: 152-159 - [c31]Anthony Chan, Pavan Balaji, William Gropp, Rajeev Thakur:
Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems. HiPC 2008: 350-364 - [c30]Pavan Balaji, Sitha Bhagvat, Rajeev Thakur, Dhabaleswar K. Panda:
Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet. HiPC 2008: 478-490 - [c29]Mithilesh Kumar, Vineeta Chaube, Pavan Balaji, Wu-chun Feng, Hyun-Wook Jin:
Making a Case for Proactive Flow Control in Optical Circuit-Switched Networks. HiPC 2008: 491-502 - [c28]Pavan Balaji, Wu-chun Feng, Heshan Lin:
Semantic-based distributed i/o with the paramedic framework. HPDC 2008: 175-184 - [c27]Ganesh Narayanaswamy, Pavan Balaji, Wu-chun Feng:
Impact of Network Sharing in Multi-Core Architectures. ICCCN 2008: 249-254 - [c26]Pavan Balaji, Wu-chun Feng, Jeremy S. Archuleta, Heshan Lin, Rajkumar Kettimuthu, Rajeev Thakur, Xiaosong Ma:
Semantics-based distributed I/O for mpiBLAST. PPoPP 2008: 293-294 - [c25]Pavan Balaji, Anthony Chan, William Gropp, Rajeev Thakur, Ewing L. Lusk:
Non-data-communication Overheads in MPI: Analysis on Blue Gene/P. PVM/MPI 2008: 13-22 - [c24]Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, Rajeev Thakur, William Gropp:
A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems. PVM/MPI 2008: 84-93 - [c23]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur:
Toward Efficient Support for Multithreaded MPI Communication. PVM/MPI 2008: 120-129 - [c22]Thomas Scogland, Pavan Balaji, Wu-chun Feng, Ganesh Narayanaswamy:
Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation. SC 2008: 17 - [c21]Heshan Lin, Pavan Balaji, Ruth Poole, Carlos P. Sosa, Xiaosong Ma, Wu-chun Feng:
Massively parallel genomic sequence search on the Blue Gene/P architecture. SC 2008: 33 - 2007
- [c20]Dhabaleswar K. Panda, Pavan Balaji:
Designing high-end computing systems with InfiniBand and10-Gigabit Ethernet iWARP. CLUSTER 2007 - [c19]Ganesh Narayanaswamy, Pavan Balaji, Wu-chun Feng:
An Analysis of 10-Gigabit Ethernet Protocol Stacks in Multicore Environments. Hot Interconnects 2007: 109-116 - [c18]Mohammad Islam, Pavan Balaji, Gerald Sabin, P. Sadayappan:
Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. ICPP 2007: 42 - [c17]Pavan Balaji, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev Thakur, William Gropp:
Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand. ICPP 2007: 73 - [c16]Pavan Balaji, Darius Buntinas, Satish Balay, Barry F. Smith, Rajeev Thakur, William Gropp:
Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI. IPDPS 2007: 1-10 - [c15]Karthikeyan Vaidyanathan, Sundeep Narravula, Pavan Balaji, Dhabaleswar K. Panda:
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers. IPDPS 2007: 1-6 - [c14]Pavan Balaji, Wu-chun Feng, Sitha Bhagvat, Dhabaleswar K. Panda, Rajeev Thakur, William Gropp:
Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP. SC 2007: 35 - 2006
- [j2]Pavan Balaji, Wu-chun Feng, Dhabaleswar K. Panda:
Bridging the Ethernet-Ethernot Performance Gap. IEEE Micro 26(3): 24-40 (2006) - [c13]Pavan Balaji, Sitha Bhagvat, Hyun-Wook Jin, Dhabaleswar K. Panda:
Asynchronous zero-copy communication for synchronous sockets in the sockets direct protocol (SDP) over InfiniBand. IPDPS 2006 - [c12]Pavan Balaji, Karthikeyan Vaidyanathan, Sundeep Narravula, Hyun-Wook Jin, Dhabaleswar K. Panda:
Designing next generation data-centers with advanced communication protocols and systems services. IPDPS 2006 - 2005
- [j1]Hyun-Wook Jin, Pavan Balaji, Chuck Yoo, Jin-Young Choi, Dhabaleswar K. Panda:
Exploiting NIC architectural support for enhancing IP-based protocols on high-performance networks. J. Parallel Distributed Comput. 65(11): 1348-1365 (2005) - [c11]Sundeep Narravula, Pavan Balaji, Karthikeyan Vaidyanathan, Hyun-Wook Jin, Dhabaleswar K. Panda:
Architecture for caching responses with multiple dynamic dependencies in multi-tier data-centers over InfiniBand. CCGRID 2005: 374-381 - [c10]Pavan Balaji, Wu-chun Feng, Qi Gao, Ranjit Noronha, Weikuan Yu, Dhabaleswar K. Panda:
Head-to-TOE Evaluation of High-Performance Sockets over Protocol Offload Engines. CLUSTER 2005: 1-10 - [c9]Pavan Balaji, Hyun-Wook Jin, Karthikeyan Vaidyanathan, Dhabaleswar K. Panda:
Supporting iWARP Compatibility and Features for Regular Network Adapters. CLUSTER 2005: 1-10 - [c8]Wu-chun Feng, Pavan Balaji, Christopher Baron, Laxmi N. Bhuyan, Dhabaleswar K. Panda:
Performance Characterization of a 10-Gigabit Ethernet TOE. Hot Interconnects 2005: 58-63 - [c7]Pavan Balaji, Sundeep Narravula, Karthikeyan Vaidyanathan, Hyun-Wook Jin, Dhabaleswar K. Panda:
On the provision of prioritization and soft qos in dynamically reconfigurable shared data-centers over infiniband. ISPASS 2005: 280-289 - 2004
- [c6]Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda:
Towards provision of quality of service guarantees in job scheduling. CLUSTER 2004: 245-254 - [c5]Pavan Balaji, Sundeep Narravula, Karthikeyan Vaidyanathan, Savitha Krishnamoorthy, Jiesheng Wu, Dhabaleswar K. Panda:
Sockets Direct Protocol over InfiniBand in clusters: is it beneficial? ISPASS 2004: 28-35 - 2003
- [c4]Pavan Balaji, Jiesheng Wu, Tahsin M. Kurç, Ümit V. Çatalyürek, Dhabaleswar K. Panda, Joel H. Saltz:
Impact of High Performance Sockets on Data Intensive Applications. HPDC 2003: 24-33 - [c3]Rinku Gupta, Pavan Balaji, Dhabaleswar K. Panda, Jarek Nieplocha:
Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters. IPDPS 2003: 46 - [c2]Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda:
QoPS: A QoS Based Scheme for Parallel Job Scheduling. JSSPP 2003: 252-268 - 2002
- [c1]Pavan Balaji, Piyush Shivam, Pete Wyckoff, Dhabaleswar K. Panda:
High Performance User Level Sockets over Gigabit Ethernet. CLUSTER 2002: 179-186
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-14 23:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint