Li et al., 2024 - Google Patents

A Data-Centric Software-Hardware Co-Designed Architecture for Large-Scale Graph Processing

Li et al., 2024

Document ID: 14659187153354674885
Author: Li Z; Chen X; Yang Y; Min F; Zhang X; Han Y
Publication year: 2024
Publication venue: IEEE Transactions on Computers

External Links

Cited by

Snippet

Graph processing plays an important role in many practical applications. However, the inherent characteristics of graph processing, including random memory access and the low computation-to-communication ratio, make it difficult to efficiently execute on traditional …

Continue reading at ieeexplore.ieee.org (other versions)

238000012545 processing 0 title abstract description 107

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units
- G06F9/3889—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by multiple instructions, e.g. MIMD, decoupled access or execute
- G06F9/3891—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by multiple instructions, e.g. MIMD, decoupled access or execute organised in groups of units sharing resources, e.g. clusters
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0806—Multiuser, multiprocessor or multiprocessing cache systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
- G06F9/30087—Synchronisation or serialisation instructions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/3004—Arrangements for executing specific machine instructions to perform operations on memory
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/25—Using a specific main memory architecture

Similar Documents

Publication	Publication Date	Title
Shi et al.	2018	Graph processing on GPUs: A survey
Gao et al.	2015	Practical near-data processing for in-memory analytics frameworks
Ahn et al.	2015	PIM-enabled instructions: A low-overhead, locality-aware processing-in-memory architecture
Bergman et al.	2008	Exascale computing study: Technology challenges in achieving exascale systems
US9116738B2 (en)	2015-08-25	Method and apparatus for efficient execution of concurrent processes on a multithreaded message passing system
Sterling et al.	2002	Gilgamesh: A multithreaded processor-in-memory architecture for petaflops computing
Huang et al.	2019	Active-routing: Compute on the way for near-data processing
KR101830685B1 (en)	2018-02-21	On-chip mesh interconnect
Addisie et al.	2018	Heterogeneous memory subsystem for natural graph analytics
Lu et al.	2022	MT-3000: a heterogeneous multi-zone processor for HPC
Wang et al.	2023	{MGG}: Accelerating graph neural networks with {Fine-Grained}{Intra-Kernel}{Communication-Computation} pipelining on {Multi-GPU} platforms
CN103744644A (en)	2014-04-23	Quad-core processor system built in quad-core structure and data switching method thereof
TW202215227A (en)	2022-04-16	Runtime patching of configuration files
Sato et al.	2021	Co-design and system for the supercomputer “Fugaku”
Tian et al.	2023	Abndp: Co-optimizing data access and load balance in near-data processing
Mohanamuraly et al.	2020	Hardware locality-aware partitioning and dynamic load-balancing of unstructured meshes for large-scale scientific applications
Li et al.	2022	GraphRing: an HMC-ring based graph processing framework with optimized data movement
Pöppl et al.	2019	A UPC++ actor library and its evaluation on a shallow water proxy application
Mirsadeghi et al.	2016	PTRAM: A parallel topology-and routing-aware mapping framework for large-scale HPC systems
Li et al.	2024	A Data-Centric Software-Hardware Co-Designed Architecture for Large-Scale Graph Processing
Zhuo et al.	2021	Distributed graph processing system and processing-in-memory architecture with precise loop-carried dependency guarantee
Lyberis	2013	Myrmics: A scalable runtime system for global address spaces
Li et al.	2019	Dual buffer rotation four-stage pipeline for CPU–GPU cooperative computing
Addisie et al.	2020	Centaur: Hybrid processing in on/off-chip memory architecture for graph analytics
Nakazawa et al.	1999	CP-PACS: A massively parallel processor at the University of Tsukuba