skip to main content
10.1145/2716282.2716287acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgpgpuConference Proceedingsconference-collections
research-article

Rethinking the parallelization of random-restart hill climbing: a case study in optimizing a 2-opt TSP solver for GPU execution

Published: 07 February 2015 Publication History

Abstract

Random-restart hill climbing is a common approach to combinatorial optimization problems such as the traveling salesman problem (TSP). We present and evaluate an implementation of random-restart hill climbing with 2-opt local search applied to TSP. Our implementation is capable of addressing large problem sizes at high throughput. It is based on the key insight that the GPU’s hierarchical hardware parallelism is best exploited with a hierarchical implementation strategy, where independent climbs are parallelized between blocks and the 2-opt evaluations are parallelized across the threads within a block. We analyze the performance impact of this and other optimizations on our heuristic TSP solver and compare its performance to existing GPU-based 2-opt TSP solvers as well as a parallel CPU implementation. Our code outperforms the existing implementations by up to 3X, evaluating up to 60 billion 2-opt moves per second on a single K40 GPU. It also outperforms an OpenMP implementation run on 20 CPU cores by up to 8X.

References

[1]
Agarwala, R., Applegate, D.L., Maglott, D., Schuler, G.D., Schaffer, A.A. 2000. A Fast and Scalable Radiation Hybrid Map Construction and Integration Strategy. Genome Res. 10, 3 (Mar. 2000), 350-364.
[2]
Bai, H., OuYang, D., Li, X., He, L., and Yu, H. 2009. MAX-MIN Ant System on GPU with CUDA. Proceedings of the Fourth International Conference on Innovative Computing, Information and Control (Dec. 2009), 801-804.
[3]
Blelloch, G. E. 1996. Programming Parallel Algorithms. Communications of the ACM. 39, 3 (Mar. 1996), 85–97.
[4]
Cecilia, J.M., García, J.M., Nisbet, A., Amos, M., and Ujaldón, M. 2013. Enhancing Data Parallelism for Ant Colony Optimization on GPUs. J. Parallel Distrib. Comput. 73, 1 (Jan. 2013), 42-51.
[5]
Chen, S., Davis, S., Jiang, H., and Novobilski, A. 2011. CUDABased Genetic Algorithm on Traveling Salesman Problem. Computer and Information Science 2011, R. Lee, Ed. Springer Berlin, Heidelberg, 241-252.
[6]
Croes, G.A. 1958. A Method for Solving Traveling-Salesman Problems. Oper. Res. 6, 791-812.
[7]
Delévacq, A., Delisle, P., and Krajecki, M. 2012. Parallel GPU Implementation of Iterated Local Search for the Travelling Salesman Problem. Proceedings of the Sixth International Conference on Learning and Intelligent Optimization (Jan. 2012), 372-377.
[8]
Delévacq, A., Delisle, P., Gravel, M., and Krajecki, M. 2013. Parallel Ant Colony Optimization on Graphics Processing Units. J. Parallel Distrib. Comput. 73, 1 (Jan. 2013), 52-61.
[9]
Dorigo, M. 1992. Optimization, Learning and Natural Algorithms, Ph.D. thesis, Politecnico di Milano, Italy, 1992.
[10]
Dorigo, M. and Gambardella, L.M. 1997. Ant Colony System: A Cooperative Learning Approach to the Traveling Salesman Problem. IEEE Transactions on Evolutionary Computation 1, 1 (Apr. 1997), 53-66.
[11]
Exnar, F., Machač, O. 2011. The Travelling Salesman Problem and its Application in Logistic Practice. WSEAS Transactions on Business and Economics 8, 4 (Oct. 2011), 163–173.
[12]
Felsenstein, J. 1995. PHYLIP (Phylogeny Inference Package). Distributed by the author, Department of Genetics, University of Washington, http://evolution.genetics.washington.edu/phylip.html.
[13]
Fujimoto, N. and Tsutsui, S. 2010. A Highly-Parallel TSP Solver for a GPU Computing Platform. Proceedings of the Seventh International Conference on Numerical Methods and Applications (Aug. 2010), 264-271.
[14]
Garey, M.R. and Johnson, D.S. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman.
[15]
Johnson, D. and McGeoch, L. 1997. The Traveling Salesman Problem: A Case Study in Local Optimization. Local Search in Combinatorial Optimization, E. Aarts and J. Lenstra, Eds. John Wiley and Sons, 215-310. 0 0.5 1 1.5 2 2.5 3 Spe edu p   of   tu ned ov er  ca lc cod e   (cl im be rs/se c) Number  of  cities Figure 8. Speedup in climbers/second throughput of tuned over calc, measured using the same climber count and initial tour
[16]
Lenstra, J.K. and Rinnooy Kan, A.H.G. 1975. Some Simple Applications of the Travelling Salesman Problem. Oper. Res. 26, 4 (Nov. 1975), 717-733.
[17]
Marti, R. 2003. Multi-Start Methods. Handbook of Metaheuristics, F. Glover and G.A. Kochenberger, Eds. Springer US, 355-368.
[18]
Menger, K. 1932. Das botenproblem. Ergebnisse eines Mathematischen Kolloquiums 2. Teubner, Leipzig, 11-23.
[19]
O’Neil, M.A., Tamir, D., and Burtscher, M. 2011. A Parallel GPU Version of the Traveling Salesman Problem. Proceedings of the 2011 International Conference on Parallel and Distributed Processing Techniques and Applications (Jul. 2011), 348-353.
[20]
Pingali, K., Nguyen, D., Kulkarni, M., Burtscher, M., Hassaan, M.A., Kaleem, R., Lee, T.-H., Lenharth, A., Manevich, R., Méndez-Lojo, M., Prountzos, D., and Sui, X. 2011. The Tao of Parallelism in Algorithms. Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation (Jun. 2011), 12-25.
[21]
Rabenseifner, R., Hager, G., and Jost, G. 2009. Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes. Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing (Feb. 2009), 427-436.
[22]
Rego, C. and Glover, F. 2002. Local Search and Metaheuristics. Traveling Salesman Problem and its Variations, G. Gutin and A.P. Punnen, Eds. Kluwer Academic Publishers, Dordrecht, 309-368.
[23]
Rocki, K. and Suda, R. 2012. Accelerating 2-opt and 3-opt Local Search Using GPU in the Travelling Salesman Problem. Proceedings of the 2012 International Conference on High Performance Computing and Simulation (Jul. 2012), 489-495.
[24]
Rocki, K. and Suda, R. 2012. An Efficient GPU Implementation of a Multi-Start TSP Solver for Large Problem Instances. Proceedings of the Fourteenth Annual Conference Companion on Genetic and Evolutionary Computation (Jul. 2012), 1441-1442.
[25]
Rocki, K. and Suda, R. 2013. High Performance GPU Accelerated Local Optimization in TSP. Proceedings of the IEEE Seventh International Parallel and Distributed Processing Symposium Workshops & PhD Forum (May 2013), 1788-1796.
[26]
TSPLIB, http://www.iwr.uni-heidelberg.de/groups/comopt/software/TSPLIB95/.
[27]
Uchida, A., Ito, Y., and Nakano, K. 2012. An Efficient GPU Implementation of Ant Colony Optimization for the Traveling Salesman Problem. In Proceedings of the Third International Conference on Networking and Computing (Dec. 2012), 94-102.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
GPGPU-8: Proceedings of the 8th Workshop on General Purpose Processing using GPUs
February 2015
120 pages
ISBN:9781450334075
DOI:10.1145/2716282
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 February 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CUDA
  2. GPGPU
  3. TSP
  4. code optimization
  5. hill climbing
  6. iterative local search
  7. program parallelization

Qualifiers

  • Research-article

Funding Sources

Conference

GPGPU-8

Acceptance Rates

Overall Acceptance Rate 57 of 129 submissions, 44%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media