default search action
John M. Mellor-Crummey
Person information
- affiliation: Rice University, Houston, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j31]Laksono Adhianto, Jonathon M. Anderson, Robert Matthew Barnett, Dragana Grbic, Vladimir Indic, Mark Krentel, Yumeng Liu, Srdan Milakovic, Wileam Phan, John M. Mellor-Crummey:
Refining HPCToolkit for application performance analysis at exascale. Int. J. High Perform. Comput. Appl. 38(6): 612-632 (2024) - [c106]Dejan Grubisic, Volker Seeker, Gabriel Synnaeve, Hugh Leather, John M. Mellor-Crummey, Chris Cummins:
Priority Sampling of Large Language Models for Compilers. EuroMLSys@EuroSys 2024: 91-97 - [c105]Ryuichi Sai, John M. Mellor-Crummey, Jinfan Xu, Mauricio Araya-Polo:
Automated Code Generation of High-Order Stencils for a Dataflow Architecture. SC 2024: 19 - [c104]Ryuichi Sai, François P. Hamon, John M. Mellor-Crummey, Mauricio Araya-Polo:
Matrix-Free Finite Volume Kernels on a Dataflow Architecture. SC 2024: 28 - [i10]Ryuichi Sai, François P. Hamon, John M. Mellor-Crummey, Mauricio Araya-Polo:
Matrix-Free Finite Volume Kernels on a Dataflow Architecture. CoRR abs/2408.03452 (2024) - 2023
- [i9]Dejan Grubisic, Bram Wasti, Chris Cummins, John M. Mellor-Crummey, Aleksandar Zlateski:
LoopTune: Optimizing Tensor Computations with Reinforcement Learning. CoRR abs/2309.01825 (2023) - [i8]Ryuichi Sai, Jinfan Xu, Mauricio Araya-Polo, John M. Mellor-Crummey:
Towards Accelerating High-Order Stencils on Modern GPUs and Emerging Architectures with a Portable Framework. CoRR abs/2309.04671 (2023) - 2022
- [j30]Ryuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Keren Zhou, Mauricio Araya-Polo, Jie Meng:
Accelerating high-order stencils on GPUs. Concurr. Comput. Pract. Exp. 34(20) (2022) - [j29]Keren Zhou, Xiaozhu Meng, Ryuichi Sai, Dejan Grubisic, John M. Mellor-Crummey:
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications. IEEE Trans. Parallel Distributed Syst. 33(4): 854-865 (2022) - [c103]Keren Zhou, Yueming Hao, John M. Mellor-Crummey, Xiaozhu Meng, Xu Liu:
ValueExpert: exploring value patterns in GPU-accelerated applications. ASPLOS 2022: 171-185 - [c102]Keren Zhou, Jonathon M. Anderson, Xiaozhu Meng, John M. Mellor-Crummey:
Low overhead and context sensitive profiling of CPU-accelerated applications. ICS 2022: 1:1-1:13 - [c101]Jonathon M. Anderson, Yumeng Liu, John M. Mellor-Crummey:
Preparing for performance analysis at exascale. ICS 2022: 34:1-34:13 - [c100]Vladimir Indic, John M. Mellor-Crummey:
Improving Tool Support for Nested Parallel Regions with Introspection Consistency. IWOMP 2022: 111-126 - 2021
- [j28]Keren Zhou, Laksono Adhianto, Jonathon M. Anderson, Aaron Cherian, Dejan Grubisic, Mark Krentel, Yumeng Liu, Xiaozhu Meng, John M. Mellor-Crummey:
Measurement and analysis of GPU-accelerated applications with HPCToolkit. Parallel Comput. 108: 102837 (2021) - [c99]Keren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey:
GPA: A GPU Performance Advisor Based on Instruction Sampling. CGO 2021: 115-125 - [c98]Ryuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng:
Using the Semi-Stencil Algorithm to Accelerate High-Order Stencils on GPUs. PMBS 2021: 63-68 - [c97]Xiaozhu Meng, Jonathon M. Anderson, John M. Mellor-Crummey, Mark W. Krentel, Barton P. Miller, Srdan Milakovic:
Parallel binary code analysis. PPoPP 2021: 76-89 - [c96]Aaron Cherian, Keren Zhou, Dejan Grubisic, Xiaozhu Meng, John M. Mellor-Crummey:
Measurement and Analysis of GPU-Accelerated OpenCL Computations on Intel GPUs. ProTools@SC 2021: 26-35 - [i7]Jonathon M. Anderson, Yumeng Liu, John M. Mellor-Crummey:
Preparing for Performance Analysis at Exascale. CoRR abs/2108.04002 (2021) - [i6]Keren Zhou, Laksono Adhianto, Jonathon M. Anderson, Aaron Cherian, Dejan Grubisic, Mark Krentel, Yumeng Liu, Xiaozhu Meng, John M. Mellor-Crummey:
Measurement and Analysis of GPU-accelerated Applications with HPCToolkit. CoRR abs/2109.06931 (2021) - 2020
- [c95]Keren Zhou, Mark W. Krentel, John M. Mellor-Crummey:
Tools for top-down performance analysis of GPU-accelerated applications. ICS 2020: 26:1-26:12 - [c94]Ryuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng:
Accelerating High-Order Stencils on GPUs. PMBS@SC 2020: 86-108 - [c93]Lai Wei, John M. Mellor-Crummey:
Using sample-based time series data for automated diagnosis of scalability losses in parallel programs. PPoPP 2020: 144-159 - [c92]Keren Zhou, Mark Krentel, John M. Mellor-Crummey:
A tool for top-down performance analysis of GPU-accelerated applications. PPoPP 2020: 415-416 - [c91]Keren Zhou, Yueming Hao, John M. Mellor-Crummey, Xiaozhu Meng, Xu Liu:
GVProf: a value profiler for GPU-based clusters. SC 2020: 89 - [i5]Xiaozhu Meng, Jonathon M. Anderson, John M. Mellor-Crummey, Mark W. Krentel, Barton P. Miller, Srdan Milakovic:
Parallelizing Binary Code Analysis. CoRR abs/2001.10621 (2020) - [i4]Keren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey:
GPA: A GPU Performance Advisor Based on Instruction Sampling. CoRR abs/2009.04061 (2020) - [i3]Ryuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng:
Accelerating High-Order Stencils on GPUs. CoRR abs/2009.04619 (2020)
2010 – 2019
- 2019
- [c90]Keren Zhou, John M. Mellor-Crummey:
A Tool for Performance Analysis of GPU-Accelerated Applications. CGO 2019: 282 - [c89]Philip Taffet, John M. Mellor-Crummey:
Lightweight, Packet-Centric Monitoring of Network Traffic and Congestion Implemented in P4. Hot Interconnects 2019: 54-58 - [c88]Philip Taffet, John M. Mellor-Crummey:
Understanding congestion in high performance interconnection networks using sampling. SC 2019: 43:1-43:24 - 2018
- [c87]Lai Wei, John M. Mellor-Crummey:
Automated Analysis of Time Series Data to Understand Parallel Program Behaviors. ICS 2018: 240-251 - [c86]Yizi Gu, John M. Mellor-Crummey:
Dynamic data race detection for OpenMP programs. SC 2018: 61:1-61:12 - 2016
- [j27]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Karthik Murthy, Milind Chabbi, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
MPI-ACC: Accelerator-Aware MPI for Scientific Applications. IEEE Trans. Parallel Distributed Syst. 27(5): 1401-1414 (2016) - [c85]Karthik Murthy, Sri Raj Paul, Kuldeep S. Meel, Tiago Cogumbreiro, John M. Mellor-Crummey:
Design and Verification of Distributed Phasers. Euro-Par 2016: 405-418 - [c84]Sri Raj Paul, Mauricio Araya-Polo, John M. Mellor-Crummey, Detlef Hohl:
Performance Analysis and Optimization of a Hybrid Seismic Imaging Application. ICCS 2016: 8-18 - [c83]Chaoran Yang, John M. Mellor-Crummey:
A wait-free queue as fast as fetch-and-add. PPoPP 2016: 16:1-16:13 - [c82]Milind Chabbi, John M. Mellor-Crummey:
Contention-conscious, locality-preserving locks. PPoPP 2016: 22:1-22:14 - [c81]Chaoran Yang, John M. Mellor-Crummey:
A Practical Solution to the Cactus Stack Problem. SPAA 2016: 61-70 - [i2]Sri Raj Paul, John M. Mellor-Crummey, Mauricio Araya-Polo, Detlef Hohl:
Performance Analysis and Optimization of a Hybrid Distributed Reverse Time Migration Application. CoRR abs/1603.03971 (2016) - 2015
- [c80]Karthik Murthy, John M. Mellor-Crummey:
Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems. PACT 2015: 150-162 - [c79]Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushik Sen, John M. Mellor-Crummey, Costin Iancu:
Barrier elision for production parallel programs. PPoPP 2015: 109-119 - [c78]Milind Chabbi, Michael W. Fagan, John M. Mellor-Crummey:
High performance locks for multi-level NUMA systems. PPoPP 2015: 215-226 - [i1]Sri Raj Paul, Karthik Murthy, Kuldeep S. Meel, John M. Mellor-Crummey:
Distributed Phasers. CoRR abs/1512.07305 (2015) - 2014
- [c77]Xu Liu, Kamal Sharma, John M. Mellor-Crummey:
ArrayTool: a lightweight profiler to guide array regrouping. PACT 2014: 405-416 - [c76]Milind Chabbi, Xu Liu, John M. Mellor-Crummey:
Call Paths for Pin Tools. CGO 2014: 76 - [c75]John M. Mellor-Crummey, Seema Hiranandani, Ajay Sethi:
Author retrospective: compilation techniques for block-cyclic distributions. ICS 25th Anniversary 2014: 29-31 - [c74]Lai Wei, John M. Mellor-Crummey:
Autotuning Tensor Transposition. IPDPS Workshops 2014: 342-351 - [c73]Rishi Surendran, Raghavan Raman, Swarat Chaudhuri, John M. Mellor-Crummey, Vivek Sarkar:
Test-driven repair of data races in structured parallel programs. PLDI 2014: 15-25 - [c72]Chaoran Yang, Wesley Bland, John M. Mellor-Crummey, Pavan Balaji:
Portable, MPI-interoperable coarray fortran. PPoPP 2014: 81-92 - [c71]Xu Liu, John M. Mellor-Crummey:
A tool to analyze the performance of multithreaded programs on NUMA architectures. PPoPP 2014: 259-272 - 2013
- [c70]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
On the efficacy of GPU-integrated MPI for scientific applications. HPDC 2013: 191-202 - [c69]Xu Liu, John M. Mellor-Crummey, Michael W. Fagan:
A new approach for performance analysis of openMP programs. ICS 2013: 69-80 - [c68]Chaoran Yang, Karthik Murthy, John M. Mellor-Crummey:
Managing Asynchronous Operations in Coarray Fortran 2.0. IPDPS 2013: 1321-1332 - [c67]Xu Liu, John M. Mellor-Crummey:
Pinpointing data locality bottlenecks with low overhead. ISPASS 2013: 183-193 - [c66]Alexandre E. Eichenberger, John M. Mellor-Crummey, Martin Schulz, Michael Wong, Nawal Copty, Robert Dietrich, Xu Liu, Eugene Loh, Daniel Lorenz:
OMPT: An OpenMP Tools Application Programming Interface for Performance Analysis. IWOMP 2013: 171-185 - [c65]Xu Liu, John M. Mellor-Crummey:
A data-centric profiler for parallel programs. SC 2013: 28:1-28:12 - [c64]Milind Chabbi, Karthik Murthy, Michael W. Fagan, John M. Mellor-Crummey:
Effective sampling-driven performance tools for GPU-accelerated supercomputers. SC 2013: 43:1-43:12 - 2012
- [c63]Milind Chabbi, John M. Mellor-Crummey:
DeadSpy: a tool to pinpoint program inefficiencies. CGO 2012: 124-134 - 2011
- [c62]Xu Liu, John M. Mellor-Crummey:
Pinpointing data locality problems using data-centric analysis. CGO 2011: 171-180 - [c61]Nathan R. Tallent, John M. Mellor-Crummey, Michael Franco, Reed Landrum, Laksono Adhianto:
Scalable fine-grained call path tracing. ICS 2011: 63-74 - [c60]Guohua Jin, John M. Mellor-Crummey, Laksono Adhianto, William N. Scherer III, Chaoran Yang:
Implementation and Performance Evaluation of the HPC Challenge Benchmarks in Coarray Fortran 2.0. IPDPS 2011: 1089-1100 - [c59]John M. Mellor-Crummey:
HIPS Keynote. IPDPS Workshops 2011: 1141 - [c58]Nathan R. Tallent, John M. Mellor-Crummey:
Using Sampling to Understand Parallel Program Performance. Parallel Tools Workshop 2011: 13-25 - [e2]Keith D. Cooper, John M. Mellor-Crummey, Vivek Sarkar:
Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Houston, TX, USA, October 7-9, 2010. Revised Selected Papers. Lecture Notes in Computer Science 6548, Springer 2011, ISBN 978-3-642-19594-5 [contents] - 2010
- [j26]Laksono Adhianto, S. Banerjee, Michael W. Fagan, Mark Krentel, Gabriel Marin, John M. Mellor-Crummey, Nathan R. Tallent:
HPCTOOLKIT: tools for performance analysis of optimized parallel programs. Concurr. Comput. Pract. Exp. 22(6): 685-701 (2010) - [j25]John M. Mellor-Crummey, William Gropp, Maurice Herlihy:
Teaching parallel programming: a roundtable discussion. XRDS 17(1): 28-30 (2010) - [c57]Laksono Adhianto, John M. Mellor-Crummey, Nathan R. Tallent:
Effectively Presenting Call Path Profiles of Application Performance. ICPP Workshops 2010: 179-188 - [c56]William N. Scherer III, Laksono Adhianto, Guohua Jin, John M. Mellor-Crummey, Chaoran Yang:
Hiding latency in Coarray Fortran 2.0. PGAS 2010: 14 - [c55]Nathan R. Tallent, John M. Mellor-Crummey, Allan Porterfield:
Analyzing lock contention in multithreaded applications. PPoPP 2010: 269-280 - [c54]Nathan R. Tallent, Laksono Adhianto, John M. Mellor-Crummey:
Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles. SC 2010: 1-11
2000 – 2009
- 2009
- [j24]Nathan R. Tallent, John M. Mellor-Crummey:
Identifying Performance Bottlenecks in Work-Stealing Computations. Computer 42(11): 44-50 (2009) - [c53]Nathan R. Tallent, John M. Mellor-Crummey, Michael W. Fagan:
Binary analysis for measurement and attribution of program performance. PLDI 2009: 441-452 - [c52]Nathan R. Tallent, John M. Mellor-Crummey:
Effective performance measurement and analysis of multithreaded applications. PPoPP 2009: 229-240 - [c51]Nathan R. Tallent, John M. Mellor-Crummey, Laksono Adhianto, Michael W. Fagan, Mark Krentel:
Diagnosing performance bottlenecks in emerging petascale applications. SC 2009 - 2008
- [c50]Gabriel Marin, John M. Mellor-Crummey:
Pinpointing and Exploiting Opportunities for Enhancing Data Reuse. ISPASS 2008: 115-126 - [c49]John M. Mellor-Crummey:
Where will all the threads come from? PPoPP 2008: 259-260 - 2007
- [c48]Cristian Coarfa, John M. Mellor-Crummey, Nathan Froyd, Yuri Dotsenko:
Scalability analysis of SPMD codes using expectations. ICS 2007: 13-22 - [c47]Gabriel Marin, John M. Mellor-Crummey:
Application Insight Through Performance Modeling. IPCCC 2007: 65-74 - [e1]Katherine A. Yelick, John M. Mellor-Crummey:
Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2007, San Jose, California, USA, March 14-17, 2007. ACM 2007, ISBN 978-1-59593-602-8 [contents] - 2006
- [j23]Yuri Dotsenko, Cristian Coarfa, Luay Nakhleh, John M. Mellor-Crummey, Usman Roshan:
PRec-I-DCM3: a parallel framework for fast and accurate large-scale phylogeny reconstruction. Int. J. Bioinform. Res. Appl. 2(4): 407-419 (2006) - [j22]Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey:
Experiences with Sweep3D implementations in Co-array Fortran. J. Supercomput. 36(2): 101-121 (2006) - [j21]Apan Qasem, Ken Kennedy, John M. Mellor-Crummey:
Automatic tuning of whole applications using direct search and a performance-based transformation system. J. Supercomput. 36(2): 183-196 (2006) - 2005
- [j20]Guohua Jin, John M. Mellor-Crummey:
Improving Performance by Reducing the Memory Footprint of Scientific Applications. Int. J. High Perform. Comput. Appl. 19(4): 433-451 (2005) - [j19]Francine Berman, Henri Casanova, Andrew A. Chien, Keith D. Cooper, Holly Dail, Anshuman Dasgupta, W. Deng, Jack J. Dongarra, S. Lennart Johnsson, Ken Kennedy, Charles Koelbel, Bo Liu, Xin Liu, Anirban Mandal, Gabriel Marin, Mark Mazina, John M. Mellor-Crummey, Celso L. Mendes, Alex Olugbile, M. Patel, Daniel A. Reed, Zhiao Shi, Otto Sievert, Huaxia Xia, Asim YarKhan:
New Grid Scheduling and Rescheduling Methods in the GrADS Project. Int. J. Parallel Program. 33(2-3): 209-229 (2005) - [j18]Ken Kennedy, Bradley Broom, Arun Chauhan, Robert J. Fowler, John Garvin, Charles Koelbel, Cheryl McCosh, John M. Mellor-Crummey:
Telescoping Languages: A System for Automatic Generation of Domain Languages. Proc. IEEE 93(2): 387-408 (2005) - [j17]Guohua Jin, John M. Mellor-Crummey:
SFCGen: A framework for efficient generation of multi-dimensional space-filling curves by recursion. ACM Trans. Math. Softw. 31(1): 120-148 (2005) - [c46]Guohua Jin, John M. Mellor-Crummey:
Space-filling Curve Generation: A Table-based Approach. AMCS 2005: 40-46 - [c45]Luay Nakhleh, Guohua Jin, Fengmei Zhao, John M. Mellor-Crummey:
Reconstructing Phylogenetic Networks Using Maximum Parsimony. CSB 2005: 93-102 - [c44]Anirban Mandal, Ken Kennedy, Charles Koelbel, Gabriel Marin, John M. Mellor-Crummey, Bo Liu, S. Lennart Johnsson:
Scheduling strategies for mapping application workflows onto the grid. HPDC 2005: 125-134 - [c43]Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey, Luay Nakhleh, Usman Roshan:
PRec-I-DCM3: A Parallel Framework for Fast and Accurate Large Scale Phylogeny Reconstruction. ICPADS (2) 2005: 346-350 - [c42]Nathan Froyd, John M. Mellor-Crummey, Robert J. Fowler:
Low-overhead call path profiling of unmodified, optimized code. ICS 2005: 81-90 - [c41]Daniel G. Chavarría-Miranda, Guohua Jin, John M. Mellor-Crummey:
COTS Clusters vs. the Earth Simulator: An Application Study Using IMPACT-3D. IPDPS 2005 - [c40]Michelle Mills Strout, John M. Mellor-Crummey, Paul D. Hovland:
Representation-independent program analysis. PASTE 2005: 67-74 - [c39]Daniel G. Chavarría-Miranda, John M. Mellor-Crummey:
Effective communication coalescing for data-parallel applications. PPoPP 2005: 14-25 - [c38]Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey, François Cantonnet, Tarek A. El-Ghazawi, Ashrujit Mohanti, Yiyi Yao, Daniel G. Chavarría-Miranda:
An evaluation of global address space languages: co-array fortran and unified parallel C. PPoPP 2005: 36-47 - 2004
- [j16]John M. Mellor-Crummey, John Garvin:
Optimizing Sparse Matrix - Vector Product Computations Using Unroll and Jam. Int. J. High Perform. Comput. Appl. 18(2): 225-236 (2004) - [c37]Yuri Dotsenko, Cristian Coarfa, John M. Mellor-Crummey:
A Multi-Platform Co-Array Fortran Compiler. IEEE PACT 2004: 29-40 - [c36]Anirban Mandal, Anshuman Dasgupta, Ken Kennedy, Mark Mazina, Charles Koelbel, Gabriel Marin, Keith D. Cooper, John M. Mellor-Crummey, Bo Liu, S. Lennart Johnsson:
Scheduling workflow applications in GrADS. CCGRID 2004: 790-797 - [c35]Keith D. Cooper, Anshuman Dasgupta, Ken Kennedy, Charles Koelbel, Anirban Mandal, Gabriel Marin, Mark Mazina, John M. Mellor-Crummey, Francine Berman, Henri Casanova, Andrew A. Chien, Holly Dail, Xin Liu, Alex Olugbile, Otto Sievert, Huaxia Xia, S. Lennart Johnsson, Bo Liu, M. Patel, Daniel A. Reed, W. Deng, Celso L. Mendes, Zhiao Shi, Asim YarKhan, Jack J. Dongarra:
New Grid Scheduling and Rescheduling Methods in the GrADS Project. IPDPS Next Generation Software Program - NSFNGS - PI Workshop 2004 - [c34]Yuri Dotsenko, Cristian Coarfa, John M. Mellor-Crummey, Daniel G. Chavarría-Miranda:
Experiences with Co-array Fortran on Hardware Shared Memory Platforms. LCPC 2004: 332-347 - [c33]Gabriel Marin, John M. Mellor-Crummey:
Cross-architecture performance predictions for scientific applications using parameterized models. SIGMETRICS 2004: 2-13 - 2003
- [j15]Daniel G. Chavarría-Miranda, John M. Mellor-Crummey:
An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications. J. Instr. Level Parallelism 5 (2003) - [j14]Alain Darte, John M. Mellor-Crummey, Robert J. Fowler, Daniel G. Chavarría-Miranda:
Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations. J. Parallel Distributed Comput. 63(9): 887-911 (2003) - [c32]Cristian Coarfa, Yuri Dotsenko, Jason Eckhardt, John M. Mellor-Crummey:
Co-array Fortran Performance and Potential: An NPB Experimental Study. LCPC 2003: 177-193 - 2002
- [j13]John M. Mellor-Crummey, Vikram S. Adve, Bradley Broom, Daniel G. Chavarría-Miranda, Robert J. Fowler, Guohua Jin, Ken Kennedy, Qing Yi:
Advanced optimization strategies in the Rice dHPF compiler. Concurr. Comput. Pract. Exp. 14(8-9): 741-767 (2002) - [j12]John M. Mellor-Crummey, Robert J. Fowler, Gabriel Marin, Nathan R. Tallent:
HPCVIEW: A Tool for Top-down Analysis of Node Performance. J. Supercomput. 23(1): 81-104 (2002) - [c31]Daniel G. Chavarría-Miranda, John M. Mellor-Crummey:
An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications. IEEE PACT 2002: 7-17 - [c30]Guohua Jin, John M. Mellor-Crummey:
Experiences tuning SMG98: a semicoarsening multigrid benchmark based on the hypre library. ICS 2002: 305-314 - [c29]Daniel G. Chavarría-Miranda, Alain Darte, Robert J. Fowler, John M. Mellor-Crummey:
Generalized Multipartitioning for Multi-Dimensional Arrays. IPDPS 2002 - [c28]Ken Kennedy, Mark Mazina, John M. Mellor-Crummey, Keith D. Cooper, Linda Torczon, Francine Berman, Andrew A. Chien, Holly Dail, Otto Sievert, Dave Angulo, Ian T. Foster, Ruth A. Aydt, Daniel A. Reed, Dennis Gannon, S. Lennart Johnsson, Carl Kesselman, Jack J. Dongarra, Sathish S. Vadhiyar, Richard Wolski:
Toward a Framework for Preparing and Executing Adaptive Grid Programs. IPDPS 2002 - 2001
- [j11]Francine Berman, Andrew A. Chien, Keith D. Cooper, Jack J. Dongarra, Ian T. Foster, Dennis Gannon, S. Lennart Johnsson, Ken Kennedy, Carl Kesselman, John M. Mellor-Crummey, Daniel A. Reed, Linda Torczon, Richard Wolski:
The GrADS Project: Software Support for High-Level Grid Application Development. Int. J. High Perform. Comput. Appl. 15(4): 327-344 (2001) - [j10]John M. Mellor-Crummey, David B. Whalley, Ken Kennedy:
Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings. Int. J. Parallel Program. 29(3): 217-247 (2001) - [j9]Ken Kennedy, Bradley Broom, Keith D. Cooper, Jack J. Dongarra, Robert J. Fowler, Dennis Gannon, S. Lennart Johnsson, John M. Mellor-Crummey, Linda Torczon:
Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries. J. Parallel Distributed Comput. 61(12): 1803-1826 (2001) - [c27]Vikram S. Adve, John M. Mellor-Crummey:
Advanced Code Generation for High Performance Fortran. Compiler Optimizations for Scalable Parallel Systems Languages 2001: 553-596 - [c26]Daniel G. Chavarría-Miranda, John M. Mellor-Crummey, Trushar Sarang:
Data-Parallel Compiler Support for Multipartitioning. Euro-Par 2001: 241-253 - [c25]John M. Mellor-Crummey, Robert J. Fowler, David B. Whalley:
Tools for application-oriented performance tuning. ICS 2001: 154-165 - [c24]Guohua Jin, John M. Mellor-Crummey, Robert J. Fowler:
Increasing temporal locality with skewing and recursive blocking. SC 2001: 43 - [c23]John M. Mellor-Crummey, Robert J. Fowler, David B. Whalley:
On providing useful information for analyzing and tuning applications. SIGMETRICS/Performance 2001: 332-333 - 2000
- [c22]Kai Zhang, John M. Mellor-Crummey, Robert J. Fowler:
Compilation and Runtime-Optimizations for Software Distributed Shared Memory. LCR 2000: 182-191 - [c21]Daniel G. Chavarría-Miranda, John M. Mellor-Crummey:
Toward Compiler Support for Scalable Parallelism Using Multipartitioning. LCR 2000: 272-284
1990 – 1999
- 1999
- [c20]John M. Mellor-Crummey, David B. Whalley, Ken Kennedy:
Improving memory hierarchy performance for irregular applications. International Conference on Supercomputing 1999: 425-433 - [c19]Collin McCurdy, John M. Mellor-Crummey:
An Evaluation of Computing Paradigms for N-Body Simulations on Distributed Memory Architectures. PPoPP 1999: 25-36 - 1998
- [j8]John M. Mellor-Crummey, Vikram S. Adve:
Simplifying Control Flow in Compiler-Generated Parallel Code. Int. J. Parallel Program. 26(5): 613-638 (1998) - [c18]Bo Lu, John M. Mellor-Crummey:
Compiler-Optimization of Implicit Reductions for Distributed Memory Multiprocessors. IPPS/SPDP 1998: 42-51 - [c17]Vikram S. Adve, John M. Mellor-Crummey:
Using Integer Sets for Data-Parallel Program Analysis and Optimization. PLDI 1998: 186-198 - [c16]Vikram S. Adve, Guohua Jin, John M. Mellor-Crummey, Qing Yi:
High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes. SC 1998: 11 - 1997
- [c15]John M. Mellor-Crummey, Vikram S. Adve:
Simplifying Control Flow in Compiler-Generated Parallel Code. LCPC 1997: 235-239 - [c14]Gerald Roth, John M. Mellor-Crummey, Ken Kennedy, R. Gregg Brickner:
Compiling Stencils in High Performance Fortran. SC 1997: 12 - 1995
- [c13]Ken Kennedy, John M. Mellor-Crummey, Gerald Roth:
Optimizing Fortran 90 Shift Operations on Distributed-Memory Multicomputers. LCPC 1995: 161-175 - [c12]Vikram S. Adve, John M. Mellor-Crummey, Mark Anderson, Ken Kennedy, Jhy-Chun Wang, Daniel A. Reed:
An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs. SC 1995: 50 - 1994
- [j7]Vikram S. Adve, Alan Carle, Elana D. Granston, Seema Hiranandani, Ken Kennedy, Charles Koelbel, Ulrich Kremer, John M. Mellor-Crummey, Scott K. Warren, Chau-Wen Tseng:
Requirements for DataParallel Programming Environments. IEEE Parallel Distributed Technol. Syst. Appl. 2(3): 48-58 (1994) - [j6]Michael L. Scott, John M. Mellor-Crummey:
Fast, contention-free combining tree barriers for shared-memory multiprocessors. Int. J. Parallel Program. 22(4): 449-481 (1994) - [c11]Seema Hiranandani, Ken Kennedy, John M. Mellor-Crummey, Ajay Sethi:
Compilation techniques for block-cyclic distributions. International Conference on Supercomputing 1994: 392-403 - [p1]Ulrich Kremer, John M. Mellor-Crummey, Ken Kennedy, Alan Carle:
Automatic Data Layout for Distributed-Memory Machines in the D Programming Environment. Automatic Parallelization 1994: 136-152 - 1993
- [j5]Keith D. Cooper, Mary W. Hall, Robert T. Hood, Ken Kennedy, Kathryn S. McKinley, John M. Mellor-Crummey, Linda Torczon, Scott K. Warren:
The ParaScope parallel programming environment. Proc. IEEE 81(2): 244-263 (1993) - [c10]Mary W. Hall, John M. Mellor-Crummey, Alan Carle, René G. Rodríguez:
FIAT: A Framework for Interprocedural Analysis and Transfomation. LCPC 1993: 522-545 - [c9]John M. Mellor-Crummey:
Compile-Time Support for Efficient Data Race Detection in Shared-Memory Parallel Programs. Workshop on Parallel and Distributed Debugging 1993: 129-139 - 1992
- [c8]Ervan Darnell, John M. Mellor-Crummey, Ken Kennedy:
Automatic software cache coherence through vectorization. ICS 1992: 129-138 - 1991
- [j4]John M. Mellor-Crummey, Michael L. Scott:
Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors. ACM Trans. Comput. Syst. 9(1): 21-65 (1991) - [c7]John M. Mellor-Crummey, Michael L. Scott:
Synchronization without Contention. ASPLOS 1991: 269-278 - [c6]John M. Mellor-Crummey, Michael L. Scott:
Scalable Reader-Writer Synchronization for Shared-Memory Multiprocessors. PPoPP 1991: 106-113 - [c5]John M. Mellor-Crummey:
On-the-fly detection of data races for programs with nested fork-join parallelism. SC 1991: 24-33 - 1990
- [j3]Thomas J. LeBlanc, John M. Mellor-Crummey, Robert J. Fowler:
Analyzing Parallel Program Executions Using Multiple Views. J. Parallel Distributed Comput. 9(2): 203-217 (1990) - [c4]Robert Hood, Ken Kennedy, John M. Mellor-Crummey:
Parallel program debugging with on-the-fly anomaly detection. SC 1990: 74-81
1980 – 1989
- 1989
- [j2]Thomas J. LeBlanc, John M. Mellor-Crummey, Neal M. Gafter, Lawrence A. Crowl, Peter C. Dibble:
The Elmwood Multiprocessor Operating System. Softw. Pract. Exp. 19(11): 1029-1055 (1989) - [c3]John M. Mellor-Crummey, Thomas J. LeBlanc:
A Software Instruction Counter. ASPLOS 1989: 78-86 - 1988
- [c2]John M. Mellor-Crummey:
Experience with the BBN Butterfly. COMPCON 1988: 101-105 - [c1]Robert J. Fowler, Thomas J. LeBlanc, John M. Mellor-Crummey:
An Integrated Approach to Parallel Program Debugging and Performance Analysis of Large-Scal Multiprocessors. Workshop on Parallel and Distributed Debugging 1988: 163-173 - 1987
- [j1]Thomas J. LeBlanc, John M. Mellor-Crummey:
Debugging Parallel Programs with Instant Replay. IEEE Trans. Computers 36(4): 471-482 (1987)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-22 19:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint