Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleJuly 2000
An efficient self-simulation algorithm for reconfigurable meshes
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 216–223https://doi.org/10.1145/341800.341825A reconfigurable mesh (RM) is the two-dimensional mesh-connected computer enhanced with a reconfigurable bus system. The bus system is used to dynamically obtain various interconnection patterns among the processors during the execution of programs. ...
- ArticleJuly 2000
Multithreaded algorithms for the fast Fourier transform
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 176–185https://doi.org/10.1145/341800.341821In this paper we present fine-grained multithreaded algorithms and implementations for the Fast Fourier Transform (FFT) problem. The FFT problem has been formulated using two distinct approaches based on the dataflow concepts. The first approach, ...
- ArticleJuly 2000
An experimental study of a simple, distributed edge coloring algorithm
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 166–175https://doi.org/10.1145/341800.341820We conduct an experimental analysis of a distributed, randomized algorithm for edge coloring simple undirected graphs. The algorithm is extremely simple, yet, according to the probabilistic analysis, it computes nearly optimal colorings very quickly [12]...
- ArticleJuly 2000
A no-busy-wait balanced tree parallel algorithmic paradigm
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 147–155https://doi.org/10.1145/341800.341818Suppose that a parallel algorithm can include any number of parallel threads. Each thread can proceed without ever having to busy wait to another thread. A thread can proceed till its termination, but no new threads can be formed. What kind of problems ...
-
- ArticleJuly 2000
DCAS-based concurrent deques
- Ole Agesen,
- David L. Detlefs,
- Christine H. Flood,
- Alexander T. Garthwaite,
- Paul A. Martin,
- Nir N. Shavit,
- Guy L. Steele
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 137–146https://doi.org/10.1145/341800.341817The computer industry is currently examining the use of strong synchronization operations such as double compare-and-swap (DCAS) as a means of supporting non-blocking synchronization on tomorrow's multiprocessor machines. However, before such a strong ...
- ArticleJuly 2000
Efficient, distributed data placement strategies for storage area networks (extended abstract)
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 119–128https://doi.org/10.1145/341800.341815In the last couple of years a dramatic growth of enterprise data storage capacity can be observed. As a result, new strategies have been sought that allow servers and storage being centralized to better manage the explosion of data and the overall cost ...
- ArticleJuly 2000
Data management in hierarchical bus networks
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 109–118https://doi.org/10.1145/341800.341814A hierarchical bus network T = (V, E) uses hierarchically, tree-like connected buses as a communication network. New communication technologies like SCI (Scalable Coherent Interface) (see, e.g., [6, 7]) make such networks very attractive, because they ...
- ArticleJuly 2000
Infinite parallel job allocation (extended abstract)
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 99–108https://doi.org/10.1145/341800.341813In recent years, the task of allocating jobs to servers has been studied with the “balls and bins” abstraction. Results in this area exploit the large decrease in maximum load that can be achieved by allowing each job (ball) a little freedom in choosing ...
- ArticleJuly 2000
Asynchronous scheduling of redundant disk arrays
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 89–98https://doi.org/10.1145/341800.341812Random redundant allocation of data to parallel disk arrays can be exploited to achieve low access delays. New algorithms are proposed which improve the previously known shortest queue algorithm by systematically exploiting that scheduling decisions can ...
- ArticleJuly 2000
Comparing the effectiveness of fine-grain memory caching against page migration/replication in reducing traffic in DSM clusters
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 79–88https://doi.org/10.1145/341800.341811In this paper, we compare and contrast two techniques to improve capacity/conflict miss traffic in CC-NUMA DSM clusters. Page migration/replication optimizes read-write accesses to a page used by a single processor by migrating the page to that ...
- ArticleJuly 2000
Generalized connection caching
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 70–78https://doi.org/10.1145/341800.341810Cohen et al. [5] recently initiated the theoretical study of connection caching in the world-wide web. They extensively studied uniform connection caching, where the establishment cost is uniform for all connections [5, 6]. They showed that ordinary ...
- ArticleJuly 2000
Fault tolerant networks with small degree
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 64–69https://doi.org/10.1145/341800.341809In this paper, we study the design of fault tolerant networks for arrays and meshes by adding redundant nodes and edges. For a target graph G (linear array or mesh in this paper), a graph G′ is called a κ-fault-tolerant graph of G if when we remove any ...
- ArticleJuly 2000
Connection caching under various models of communication
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 54–63https://doi.org/10.1145/341800.341808Motivated by Web applications, we recently introduced the following theoretical model for connection-caching: Each host on a network can maintain (cache) a limited number of connections to other hosts. A message can be transmitted from one host to ...
- ArticleJuly 2000
Efficient on-line communication in cellular networks
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 46–53https://doi.org/10.1145/341800.341807In this paper we consider communication issues arising in mobile networks that utilize Frequency Division Multiplexing (FDM) technology. In such networks, many users within the same geographical region can communicate simultaneously with other users of ...
- ArticleJuly 2000
Interprocessor communication with memory constraints
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 39–45https://doi.org/10.1145/341800.341806Many parallel applications require periodic redistribution of workloads and associated data. In a distributed memory computer, this redistribution can be difficult if limited memory is available for receiving messages. We propose a model for optimizing ...
- ArticleJuly 2000
Diffusive load balancing schemes on heterogeneous networks
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 30–38https://doi.org/10.1145/341800.341805Up to now, diffusive load balancing schemes have only been developed for homogeneous networks. We generalize existing diffusion schemes, in order to deal with heterogeneous networks. In these networks, every processor can have arbitrary computing power, ...
- ArticleJuly 2000
Optimal schedules for data-parallel cycle-stealing in networks of workstations (extended abstract)
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 22–29https://doi.org/10.1145/341800.341804We refine the model underlying our prior work on scheduling cycle-stealing opportunities in NOWs [5, 16], obtaining a model wherein the scheduling guidelines of [16] produce optimal schedules for every such opportunity. Although computing optimal ...
- ArticleJuly 2000
Scheduling Cilk multithreaded parallel programs on processors of different speeds
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 13–21https://doi.org/10.1145/341800.341803We study the problem of executing parallel programs, in particular Cilk programs, on a collection of processors of different speeds. We consider a model in which each processor maintains an estimate of its own speed, where communication between ...
- ArticleJuly 2000
The data locality of work stealing
SPAA '00: Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architecturesPages 1–12https://doi.org/10.1145/341800.341801This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlled shared-memory machines. We present lower and upper bounds on the number of cache misses using work stealing, and introduce a locality-guided work-...