Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
Denseflex: A Low Rank Factorization Methodology for Adaptable Dense Layers in DNNs
CF '24: Proceedings of the 21st ACM International Conference on Computing FrontiersPages 21–31https://doi.org/10.1145/3649153.3649183Low-Rank Factorization (LRF) is a popular compression technique used in Deep Neural Networks (DNNs). LRF can reduce both the memory size and the arithmetic operations in a DNN layer by approximating a weight tensor/matrix by two or more smaller tensors/...
- research-articleOctober 2023
Design and Implementation of Deep Learning 2D Convolutions on Modern CPUs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 34, Issue 12Pages 3104–3116https://doi.org/10.1109/TPDS.2023.3322037In this article, a new method is provided for accelerating the execution of convolution layers in Deep Neural Networks. This research work provides the theoretical background to efficiently design and implement the convolution layers on x86/x64 CPUs, ...