• Breiter S, Trotter J and Fürlinger K. Modelling Data Locality of Sparse Matrix-Vector Multiplication on the A64FX. Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis. (1334-1342).

    https://doi.org/10.1145/3624062.3624198

  • Wang K, Liu J and Chen F. (2020). Put an elephant into a fridge. Proceedings of the VLDB Endowment. 13:9. (1540-1554). Online publication date: 1-May-2020.

    https://doi.org/10.14778/3397230.3397247

  • Sung H, Kim M, Min J and Eom H. (2018). OLM. The Journal of Supercomputing. 74:2. (637-664). Online publication date: 1-Feb-2018.

    https://doi.org/10.1007/s11227-017-2181-9

  • Hu Y, Song M and Li T. (2017). Towards "Full Containerization" in Containerized Network Function Virtualization. ACM SIGPLAN Notices. 52:4. (467-481). Online publication date: 12-May-2017.

    https://doi.org/10.1145/3093336.3037713

  • Hu Y, Song M and Li T. (2017). Towards "Full Containerization" in Containerized Network Function Virtualization. ACM SIGARCH Computer Architecture News. 45:1. (467-481). Online publication date: 11-May-2017.

    https://doi.org/10.1145/3093337.3037713

  • Hu Y, Song M and Li T. Towards "Full Containerization" in Containerized Network Function Virtualization. Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems. (467-481).

    https://doi.org/10.1145/3037697.3037713

  • Liao X, Guo R, Jin H, Yue J and Tan G. (2017). Enhancing the Malloc System with Pollution Awareness for Better Cache Performance. IEEE Transactions on Parallel and Distributed Systems. 28:3. (731-745). Online publication date: 1-Mar-2017.

    https://doi.org/10.1109/TPDS.2016.2587644

  • Lavaee R. (2016). The hardness of data packing. ACM SIGPLAN Notices. 51:1. (232-242). Online publication date: 8-Apr-2016.

    https://doi.org/10.1145/2914770.2837669

  • Lavaee R. The hardness of data packing. Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. (232-242).

    https://doi.org/10.1145/2837614.2837669

  • Gracioli G, Alhammad A, Mancuso R, Fröhlich A and Pellizzoni R. (2015). A Survey on Cache Management Mechanisms for Real-Time Embedded Systems. ACM Computing Surveys. 48:2. (1-36). Online publication date: 21-Nov-2015.

    https://doi.org/10.1145/2830555

  • Pan A and Pai V. Runtime-driven shared last-level cache management for task-parallel programs. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/2807591.2807625

  • Kuang W, Brown L and Wang Z. Modeling cross-architecture co-tenancy performance interference. Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing. (231-240).

    https://doi.org/10.1109/CCGrid.2015.152

  • Ye Y, West R, Cheng Z and Li Y. COLORIS. Proceedings of the 23rd international conference on Parallel architectures and compilation. (381-392).

    https://doi.org/10.1145/2628071.2628104

  • Liu X, Sharma K and Mellor-Crummey J. ArrayTool. Proceedings of the 23rd international conference on Parallel architectures and compilation. (405-416).

    https://doi.org/10.1145/2628071.2628102

  • Zhang L, Liu Y, Wang R and Qian D. (2014). Lightweight dynamic partitioning for last-level cache of multicore processor on real system. The Journal of Supercomputing. 69:2. (547-560). Online publication date: 1-Aug-2014.

    https://doi.org/10.1007/s11227-014-1092-2

  • Cui Z, Chen L, Bao Y and Chen M. A Swap-based Cache Set Index Scheme to Leverage both Superpage and Page Coloring Optimizations. Proceedings of the 51st Annual Design Automation Conference. (1-6).

    https://doi.org/10.1145/2593069.2593078

  • Huang Y, Chen L, Cui Z, Ruan Y, Bao Y, Chen M and Sun N. (2014). HMTT. ACM Transactions on Architecture and Code Optimization. 11:1. (1-25). Online publication date: 1-Feb-2014.

    https://doi.org/10.1145/2579668

  • Pan A and Pai V. Imbalanced cache partitioning for balanced data-parallel programs. Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture. (297-309).

    https://doi.org/10.1145/2540708.2540734

  • Brock J, Gu X, Bao B and Ding C. (2013). Pacman. ACM SIGPLAN Notices. 48:11. (39-50). Online publication date: 4-Dec-2013.

    https://doi.org/10.1145/2555670.2466482

  • Zheng F, Yu H, Hantas C, Wolf M, Eisenhauer G, Schwan K, Abbasi H and Klasky S. GoldRush. Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/2503210.2503279

  • Ma R, Ye W, Liang A, Guan H and Li J. (2013). Cache isolation for virtualization of mixed general-purpose and real-time systems. Journal of Systems Architecture: the EUROMICRO Journal. 59:10. (1405-1413). Online publication date: 1-Nov-2013.

    https://doi.org/10.1016/j.sysarc.2013.07.004

  • Brock J, Gu X, Bao B and Ding C. Pacman. Proceedings of the 2013 international symposium on memory management. (39-50).

    https://doi.org/10.1145/2491894.2466482

  • Brock J, Gu X, Bao B and Ding C. Pacman. Proceedings of the 2013 international symposium on memory management. (39-50).

    https://doi.org/10.1145/2464157.2466482

  • Vasilache N, Baskaran M, Meister B and Lethin R. Memory reuse optimizations in the R-Stream compiler. Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units. (42-53).

    https://doi.org/10.1145/2458523.2458528

  • Andrade D, Fraguela B and Doallo R. (2013). Accurate prediction of the behavior of multithreaded applications in shared caches. Parallel Computing. 39:1. (36-57). Online publication date: 1-Jan-2013.

    https://doi.org/10.1016/j.parco.2012.11.003

  • Huang T, Zhong Q, Guan X, Wang X, Cheng X and Wang K. Reducing last level cache pollution through OS-level software-controlled region-based partitioning. Proceedings of the 27th Annual ACM Symposium on Applied Computing. (1779-1784).

    https://doi.org/10.1145/2245276.2232065

  • Zhong Q, Guan X, Huang T, Cheng X and Wang K. Affinity-aware DMA buffer management for reducing off-chip memory access. Proceedings of the 27th Annual ACM Symposium on Applied Computing. (1588-1593).

    https://doi.org/10.1145/2245276.2232031

  • Kim J, Kim I and Eom Y. Code-based cache partitioning for improving hardware cache performance. Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication. (1-5).

    https://doi.org/10.1145/2184751.2184803

  • Zhang Y, Jiao M, Wang Z, Wang S and Zhou X. W-Order scan. Proceedings of the 12th international conference on Web-age information management. (480-492).

    /doi/10.5555/2035562.2035617

  • Ding X, Wang K and Zhang X. (2011). ULCC. ACM SIGPLAN Notices. 46:8. (103-112). Online publication date: 7-Sep-2011.

    https://doi.org/10.1145/2038037.1941568

  • Perarnau S, Tchiboukdjian M and Huard G. Controlling cache utilization of HPC applications. Proceedings of the international conference on Supercomputing. (295-304).

    https://doi.org/10.1145/1995896.1995942

  • Ding X, Wang K and Zhang X. SRM-buffer. Proceedings of the sixth conference on Computer systems. (243-256).

    https://doi.org/10.1145/1966445.1966468

  • Ding X, Wang K and Zhang X. ULCC. Proceedings of the 16th ACM symposium on Principles and practice of parallel programming. (103-112).

    https://doi.org/10.1145/1941553.1941568

  • Lee R, Ding X, Chen F, Lu Q and Zhang X. (2009). MCC-DB. Proceedings of the VLDB Endowment. 2:1. (373-384). Online publication date: 1-Aug-2009.

    https://doi.org/10.14778/1687627.1687670