Röhrig-Zöllner et al., 2015 - Google Patents
Increasing the performance of the Jacobi--Davidson method by blockingRöhrig-Zöllner et al., 2015
View PDF- Document ID
- 12856722638119789124
- Author
- Röhrig-Zöllner M
- Thies J
- Kreutzer M
- Alvermann A
- Pieper A
- Basermann A
- Hager G
- Wellein G
- Fehske H
- Publication year
- Publication venue
- SIAM Journal on Scientific Computing
External Links
Snippet
Block variants of the Jacobi--Davidson method for computing a few eigenpairs of a large sparse matrix are known to improve the robustness of the standard algorithm when it comes to computing multiple or clustered eigenvalues. In practice, however, they are typically …
- 230000000903 blocking 0 title abstract description 22
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/60—Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers
- G06F7/72—Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers using residue arithmetic
- G06F7/724—Finite field arithmetic
- G06F7/726—Inversion; Reciprocal calculation; Division of elements of a finite field
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/701—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for molecular modelling, e.g. calculation and theoretical details of quantum mechanics, molecular mechanics, molecular dynamics, Monte Carlo methods, conformational analysis or the like
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/003—Reconstruction from projections, e.g. tomography
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Röhrig-Zöllner et al. | Increasing the performance of the Jacobi--Davidson method by blocking | |
Marek et al. | The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science | |
Aktulga et al. | Optimizing sparse matrix-multiple vectors multiplication for nuclear configuration interaction calculations | |
Clark et al. | Solving Lattice QCD systems of equations using mixed precision solvers on GPUs | |
Jaiswal et al. | FPGA-based high-performance and scalable block LU decomposition architecture | |
Anzt et al. | Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product. | |
Pieper et al. | High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations | |
Villa et al. | Power/performance trade-offs of small batched LU based solvers on GPUs | |
Choi et al. | High-performance dense tucker decomposition on GPU clusters | |
Elafrou et al. | SparseX: A library for high-performance sparse matrix-vector multiplication on multicore platforms | |
Amaricci et al. | EDIpack: A parallel exact diagonalization package for quantum impurity problems | |
Nechma et al. | Parallel sparse matrix solution for circuit simulation on FPGAs | |
Aktulga et al. | A high performance block eigensolver for nuclear configuration interaction calculations | |
Kreutzer et al. | Performance engineering of the Kernel Polynomal Method on large-scale CPU-GPU systems | |
Zapletal et al. | Parallel and vectorized implementation of analytic evaluation of boundary integral operators | |
Utsuno et al. | Efficient computation of Hamiltonian matrix elements between non-orthogonal Slater determinants | |
Moustafa et al. | Shared memory parallelism for 3D Cartesian discrete ordinates solver | |
Mudigere et al. | Exploring shared-memory optimizations for an unstructured mesh CFD application on modern parallel systems | |
Hakala et al. | Parallel electronic structure calculations using multiple graphics processing units (GPUs) | |
Anzt et al. | Energy efficiency and performance frontiers for sparse computations on GPU supercomputers | |
Gnanavignesh et al. | Parallel sparse LU factorization of power flow Jacobian using GPU | |
Isupov | Multiple-precision sparse matrix–vector multiplication on GPUs | |
Wittmann et al. | Multicore performance engineering of sparse triangular solves using a modified roofline model | |
Hirano et al. | A third-generation density-functional-theory-based method for calculating canonical molecular orbitals of large molecules | |
Kreutzer et al. | Chebyshev filter diagonalization on modern manycore processors and GPGPUs |