西安电子科技大学学报 ›› 2023, Vol. 50 ›› Issue (3): 132-141.doi: 10.19665/j.issn1001-2400.2023.03.013

• 计算机科学与技术 & 网络空间安全 • 上一篇    下一篇

分布式存储中新型分组piggybacking框架

王禹博1(),孙蓉1,2(),刘景伟3()   

  1. 1.西安电子科技大学 空天地一体化综合业务网全国重点实验室,陕西 西安 710071
    2.华侨大学 厦门市移动多媒体通信重点实验室,福建 厦门 361021
    3.西安电子科技大学 陕西省区块链与安全计算重点实验室,陕西 西安 710071
  • 收稿日期:2022-08-16 出版日期:2023-06-20 发布日期:2023-10-13
  • 通讯作者: 孙蓉
  • 作者简介:王禹博(1999—),男,西安电子科技大学硕士研究生,E-mail:ybwang21@stu.xidian.edu.cn;|刘景伟(1978—),男,副教授,博士,E-mail: jwliu@mail.xidian.edu.cn
  • 基金资助:
    国家重点研发计划(2021YFA000500);国家自然科学基金(62171356);陕西省重点研发计划-重点产业创新链项目(2020ZDLGY05-04);陕西省重点研发计划-重点产业创新链项目(2021ZDLGY05-03)

New grouped piggybacking framework for distributed storage

WANG Yubo1(),SUN Rong1,2(),LIU Jingwei3()   

  1. 1. State Key Laboratory of Integrated Services Networks,Xidian University,Xi’an 710071,China
    2. Xiamen Key Laboratory of Mobile Multimedia Communications,Huaqiao University,Xiamen 361021,China
    3. Shaanxi Key Laboratory of Blockchain and Secure Computing,Xidian University,Xi’an 710071,China
  • Received:2022-08-16 Online:2023-06-20 Published:2023-10-13
  • Contact: Rong SUN

摘要:

随着互联网技术的快速发展,全球数据量爆炸式增长对分布式存储系统的可靠性和可用性提出了严峻的挑战。Piggybacking框架作为一种高效的数据容错技术成为近年来研究的热点。目前,绝大多数piggybacking框架均以牺牲子条带数和修复度的方式来降低修复带宽,导致修复失效节点过程中所节省的数据量通常无法有效地提升磁盘读取效率。针对上述情况,提出一种新型分组piggybacking框架,在保证较低修复带宽的同时,进一步降低了子条带数和信息节点修复度,从而提升分布式存储系统的输入/输出性能。该框架将校验节点分为两个部分,信息符号和校验符号分别按照一定规则分组后依次嵌入到对应的校验节点中,设计简单。通过这种设计,可以同时有效修复信息节点和校验节点,既减少了子条带数,又使其在校验节点个数较多时拥有较强的综合修复能力。与其他piggybacking框架相比,新型分组piggybacking框架能更好地权衡修复带宽、修复度与子条带数,适合在实际系统中应用。

关键词: 分布式存储, piggybacking框架, 子条带数, 修复度, 修复带宽

Abstract:

With the rapid development of Internet technology,the explosive growth of the global data volume has led to a serious challenge to the reliability and availability of distributed storage systems.As an efficient data fault-tolerant technology,the piggybacking framework has become a research hotspot in recent years.Most piggybacking frameworks reduce the repair bandwidth by sacrificing the number of substripes and the repair degree,in which the amount of data saved in the process of repairing ineffective nodes usually cannot effectively improve the disk reading efficiency.For the above situation,a novel grouped piggybacking framework is proposed,which can reduce both the number of substripes and the repair degree of information nodes while ensuring a low repair bandwidth,thereby improving the Input/Output(I/O) performance of distributed storage systems.In the novel framework,the parity nodes are divided into two parts,and information symbols and parity symbols are grouped according to certain rules so as to be piggybacked into the corresponding parity nodes.The process is simple to implement.Using the proposed framework,information nodes and parity nodes can be effectively repaired at the same time,which not only reduces the number of substripes,but also makes it to have a strong comprehensive repair ability when the number of parity nodes is large.Compared with other piggybacking frameworks,the new grouped piggybacking framework can better balance the repair bandwidth,the repair degree and the number of substripes,and is suitable for application in actual systems.

Key words: distributed storage, piggybacking framework, the number of substripes, repair degree, repair bandwidth

中图分类号: 

  • TN911.22