SAGE-HPCA: Vol 25, No 4

Volume 25, Issue 4November 2011

Volume 25, Issue 4

November 2011

Publisher:

Sage Publications, Inc.
2455 Teller Road Thousand Oaks, CA
United States

ISSN:1094-3420

Tags:

Betweenness centrality
CMP$im simulator
Grid computing
ILU factorization
Markov clustering

Get Alerts for this PeriodicalAlerts Save to BinderBinder Export CitationCitation

Share on

Reflects downloads up to 06 Feb 2025Bibliometrics

Citation Count

190

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

Sections

Volume 25 , Issue 4

November 2011

PreviousIssue NextIssue

Skip Table Of Content Section

Select All

Export Citations Save to Binder

other

Special Issue on Programming Models, Software and Tools for High-End Computing

Balaji Pavan,
Vishnu Abhinav

Pages 353–354https://doi.org/10.1177/1094342011414549

- 0
Metrics
Total Citations0

other

Global-aware and multi-order context-based prefetching for high-performance processors

Yong Chen,
Huaiyu Zhu,
Philip C. Roth,
Hui Jin,
Xian-He Sun

Pages 355–370https://doi.org/10.1177/1094342010394386

Data prefetching is widely used in high-end computing systems to accelerate data accesses and to bridge the increasing performance gap between processor and memory. Context-based prefetching has become a primary focus of study in recent years due to its ...

- 1
Metrics
Total Citations1

Abstract

research-article

Periodic hierarchical load balancing for large supercomputers

Gengbin Zheng,
Abhinav Bhatelé,
Esteban Meneses,
Laxmikant V. Kalé

Pages 371–385https://doi.org/10.1177/1094342010394383

Large parallel machines with hundreds of thousands of processors are becoming more prevalent. Ensuring good load balance is critical for scaling certain classes of parallel applications on even thousands of processors. Centralized load balancing ...

- 21
Metrics
Total Citations21

Abstract

research-article

Sparse triangular solves for ILU revisited: data layout crucial to better performance

Barry Smith,
Hong Zhang

Pages 386–391https://doi.org/10.1177/1094342010389857

A key to good processor utilization for sparse matrix computations is storing the data in the format that is most conducive to fast access by the memory system. In particular, for sparse matrix triangular solves the traditional compressed sparse matrix ...

- 5
Metrics
Total Citations5

Abstract

research-article

A general method for modeling on irregular grids

Alexander E. Macdonald,
Jacques Middlecoff,
Tom Henderson,
Jin-Luen Lee

Pages 392–403https://doi.org/10.1177/1094342010385019

For simulation on a spherical surface, such as global numerical weather prediction, icosahedral grids are superior to their competitors in uniformity of grid mesh distance across the entire globe and lack of neighboring grid cells that share only a ...

- 3
Metrics
Total Citations3

Abstract

research-article

Color and texture analysis using emerging parallel architectures

Francisco D Igual,
Rafael Mayo,
Timothy Dr Hartley,
Ümit V Çatalyürek,
Antonio Ruiz,
Manuel Ujaldon

Pages 404–427https://doi.org/10.1177/1094342010390340

While image texture is effective for use in pattern-recognition and image-analysis algorithms, textural features are time-consuming to calculate on standard CPUs. Therefore, we present novel implementations of textural-feature algorithms on graphics ...

- 0
Metrics
Total Citations0

Abstract

research-article

Trace-based performance analysis for the petascale simulation code FLASH

Heike Jagode,
Andreas Knüpfer,
Jack Dongarra,
Matthias Jurenz,
Matthias S Müller,
Wolfgang E Nagel

Pages 428–439https://doi.org/10.1177/1094342010387806

Performance analysis of applications on modern high-end petascale systems is increasingly challenging due to the rising complexity and quantity of the computing units. This paper presents a performance-analysis study using the Vampir performance-...

- 0
Metrics
Total Citations0

Abstract

research-article

Fast iterative solution of large sparse linear systems on geographically separated clusters

Tp Collignon,
Mb Van Gijzen

Pages 440–450https://doi.org/10.1177/1094342010388541

Parallel asynchronous iterative algorithms exhibit features that are extremely well-suited for Grid computing, such as lack of synchronization points. Unfortunately, they also suffer from slow convergence rates. In this paper we propose using ...

- 0
Metrics
Total Citations0

Abstract

research-article

Measuring TeraGrid: workload characterization for a high-performance computing federation

David L Hart

Pages 451–465https://doi.org/10.1177/1094342010394382

TeraGrid has deployed a significant monitoring and accounting infrastructure in order to understand its operational success. In this paper, we present an analysis of the jobs reported by TeraGrid for 2008. We consider the workload from several ...

- 15
Metrics
Total Citations15

Abstract

other

Scalability studies and large grid computations for surface combatant using CFDShip-Iowa

Shanti Bhushan,
Pablo Carrica,
Jianming Yang,
Frederick Stern

Pages 466–487https://doi.org/10.1177/1094342010394887

Scalability studies and computations using the largest grids to date for free-surface flows are performed using message-passing interface (MPI)-based CFDShip-Iowa toolbox curvilinear (V4) and Cartesian (V6) grid solvers on Navy high-performance ...

- 0
Metrics
Total Citations0

Abstract

other

Parallel solution of the obstacle problem in Grid environments

M. Chau,
R. Couturier,
J. Bahi,
P. Spiteri

Pages 488–495https://doi.org/10.1177/1094342010395412

The present study deals with the solution of the obstacle problem defined in a three-dimensional domain. In order to solve a large-scale obstacle problem, the use of parallelism is necessary. In this work we present a parallel synchronous iterative ...

- 2
Metrics
Total Citations2

Abstract

other

The Combinatorial BLAS: design, implementation, and applications

Aydın Buluç,
John R Gilbert

Pages 496–509https://doi.org/10.1177/1094342011403516

This paper presents a scalable high-performance software library to be used for graph analysis and data mining. Large combinatorial graphs appear in many applications of high-performance computing, including computational biology, informatics, analytics,...

- 143
Metrics
Total Citations143

Abstract

Save to Binder

Create a New Binder

Name

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation