西安电子科技大学学报 ›› 2024, Vol. 51 ›› Issue (5): 71-81.doi: 10.19665/j.issn1001-2400.20240101

• 信息与通信工程 • 上一篇    下一篇

组相联可自适应扩展的缓存架构及其性能分析

周昱(), 于宗光(), 高杨(), 邵健(), 罗庆()   

  1. 中国电子科技集团公司第五十八研究所,江苏 无锡 214072
  • 收稿日期:2023-09-28 出版日期:2024-08-07 发布日期:2024-08-07
  • 作者简介:周 昱(1983—),男,高级工程师,E-mail:zhouyu@suntai.cn
    于宗光(1964—),男,研究员,E-mail:yuzg58@163.com
    高 杨(1989—),男,高级工程师,E-mail:gaoyang_1989@tom.com
    邵 健(1989—),男,高级工程师,E-mail:shaojian1008@126.com
    罗 庆(1994—),男,工程师,E-mail:1036727743@qq.com
  • 基金资助:
    国家自然科学基金(62204233);江苏省自然科学基金(BK20211040);江苏省自然科学基金(BK20211041)

Set associativity adaptively extended cache architecture and performance analysis

ZHOU Yu(), YU Zongguang(), GAO Yang(), SHAO Jian(), LUO Qing()   

  1. No.58 Research Institute,China Electronic Technology Group Corporation,Wuxi 214072,China
  • Received:2023-09-28 Online:2024-08-07 Published:2024-08-07

摘要:

在现代处理器体系架构中,缓存是解决存储墙瓶颈的重要手段,但是缓存访问需求是随程序甚至是程序片段的切换而变化的,这导致传统的固定参数配置的缓存架构难以在长时间或在程序间依然保持高效性能。文中提出一种缓存组相联度的自适应扩展方法,能根据程序运行时缓存组活跃状态,利用短时非活跃缓存组的存储空间,来扩展当前活跃缓存组的组相联数目,并可实时动态调整组与组之间的扩展互联关系,有效提升缓存空间的整体利用效率。文中在Gem5软件中对所提出的缓存组相联自适应扩展架构进行了仿真,并基于SPEC CPU 2017 基准测试集进行了性能测试,结果显示所提方法明显改善了缓存组访问的均匀性,对典型程序缓存组使用频次的均匀性最大提升23.14%左右,降低缓存访问缺失数最大可达54.2%。硬件实现和仿真结果显示,与HY-Way等低功耗可重构缓存架构相比,文中所述缓存架构资源消耗减少了7.66%以上,在嵌入式处理器设计中有较大的应用价值。

关键词: 组相联存储, 缓存架构, 组扩展, 组利用率, 性能仿真

Abstract:

In modern processor architectures,caching is an important method to solve the bottleneck of the memory wall.However,the requirement of cache access changes with the switching of the program or even a program fragment,which makes it difficult for traditional fixed-parameter configuration cache architectures to maintain a stable and efficient performance for a long time or between programs.This paper proposes an adaptively extended method for cache set associativity,which can use a short-term inactive cache set to expand the number of set associations of the current active cache set while the program is running,dynamically adjust the extended interconnection relationship between cache sets in real-time,and effectively improve the overall utilization efficiency of cache storage space.In this paper,the proposed cache architecture is simulated by Gem5 software,and the performance test is carried out based on the SPEC CPU 2017 benchmark.Simulation results show that the proposed method significantly improves the uniformity of cache set access with a maximum rate of about 23.14% for a typical program,and improves the reduction in missing ratio to a maximum of 54.2%.Hardware implementation and simulation results show that compared to low-power reconfigurable cache architectures such as HY-Way architecture,the proposed cache architecture reduces resource consumption by more than 7.66%,which has a significant application value in embedded processor designs.

Key words: associative storage, cache architecture, way extended, set utilization, performance simulation

中图分类号: 

  • TP3