Röhrig-Zöllner et al., 2015 - Google Patents

Increasing the performance of the Jacobi--Davidson method by blocking

Röhrig-Zöllner et al., 2015

View PDF
Document ID
12856722638119789124
Author
Röhrig-Zöllner M
Thies J
Kreutzer M
Alvermann A
Pieper A
Basermann A
Hager G
Wellein G
Fehske H
Publication year
Publication venue
SIAM Journal on Scientific Computing

External Links

Snippet

Block variants of the Jacobi--Davidson method for computing a few eigenpairs of a large sparse matrix are known to improve the robustness of the standard algorithm when it comes to computing multiple or clustered eigenvalues. In practice, however, they are typically …
Continue reading at elib.dlr.de (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/30Arrangements for executing machine-instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30007Arrangements for executing specific machine instructions to perform operations on data operands
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/60Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers
    • G06F7/72Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers using residue arithmetic
    • G06F7/724Finite field arithmetic
    • G06F7/726Inversion; Reciprocal calculation; Division of elements of a finite field
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F1/00Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/70Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
    • G06F19/701Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for molecular modelling, e.g. calculation and theoretical details of quantum mechanics, molecular mechanics, molecular dynamics, Monte Carlo methods, conformational analysis or the like
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/003Reconstruction from projections, e.g. tomography
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/002Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation

Similar Documents

Publication Publication Date Title
Röhrig-Zöllner et al. Increasing the performance of the Jacobi--Davidson method by blocking
Marek et al. The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science
Aktulga et al. Optimizing sparse matrix-multiple vectors multiplication for nuclear configuration interaction calculations
Clark et al. Solving Lattice QCD systems of equations using mixed precision solvers on GPUs
Jaiswal et al. FPGA-based high-performance and scalable block LU decomposition architecture
Anzt et al. Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product.
Pieper et al. High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations
Villa et al. Power/performance trade-offs of small batched LU based solvers on GPUs
Choi et al. High-performance dense tucker decomposition on GPU clusters
Elafrou et al. SparseX: A library for high-performance sparse matrix-vector multiplication on multicore platforms
Amaricci et al. EDIpack: A parallel exact diagonalization package for quantum impurity problems
Nechma et al. Parallel sparse matrix solution for circuit simulation on FPGAs
Aktulga et al. A high performance block eigensolver for nuclear configuration interaction calculations
Kreutzer et al. Performance engineering of the Kernel Polynomal Method on large-scale CPU-GPU systems
Zapletal et al. Parallel and vectorized implementation of analytic evaluation of boundary integral operators
Utsuno et al. Efficient computation of Hamiltonian matrix elements between non-orthogonal Slater determinants
Moustafa et al. Shared memory parallelism for 3D Cartesian discrete ordinates solver
Mudigere et al. Exploring shared-memory optimizations for an unstructured mesh CFD application on modern parallel systems
Hakala et al. Parallel electronic structure calculations using multiple graphics processing units (GPUs)
Anzt et al. Energy efficiency and performance frontiers for sparse computations on GPU supercomputers
Gnanavignesh et al. Parallel sparse LU factorization of power flow Jacobian using GPU
Isupov Multiple-precision sparse matrix–vector multiplication on GPUs
Wittmann et al. Multicore performance engineering of sparse triangular solves using a modified roofline model
Hirano et al. A third-generation density-functional-theory-based method for calculating canonical molecular orbitals of large molecules
Kreutzer et al. Chebyshev filter diagonalization on modern manycore processors and GPGPUs