西安电子科技大学学报 ›› 2021, Vol. 48 ›› Issue (4): 113-119.doi: 10.19665/j.issn1001-2400.2021.04.015

• 计算机科学与技术&网络空间安全 • 上一篇    下一篇

非MDS码存储系统的通用可靠性模型

聂世强(),郑旭达(),刘钊华(),伍卫国(),董小社(),张兴军()   

  1. 西安交通大学 计算机科学与技术学院,陕西 西安 710049
  • 收稿日期:2020-03-02 出版日期:2021-08-30 发布日期:2021-08-31
  • 通讯作者: 伍卫国
  • 作者简介:聂世强(1993—),男,西安交通大学博士研究生,E-mail: nsqiang@gmail.com|郑旭达(1997—),男,西安交通大学硕士研究生,E-mail: zhengxd19@stu.xjtu.edu.cn|刘钊华(1995—),男,西安交通大学硕士研究生,E-mail: 584799633@qq.com|董小社(1963—),男,教授,E-mail: xsdong@xjtu.edu.cn|张兴军(1969—),男,教授,E-mail: xjzhang@xjtu.edu.cn
  • 基金资助:
    国家重点研发计划(2016YFB1000300);国家自然科学基金(61972311);山东省自然科学基金(ZR2019LZH007)

General reliability model of the non-MDS coded storage system

NIE Shiqiang(),ZHENG Xuda(),LIU Zhaohua(),WU Weiguo(),DONG Xiaoshe(),ZHANG Xingjun()   

  1. School of Computer Science and Technology,Xi’an Jiaotong University,Xi’an 710049,China
  • Received:2020-03-02 Online:2021-08-30 Published:2021-08-31
  • Contact: Weiguo WU

摘要:

为了量化基于非最大距离可分码的分布式存储系统的可靠性,从非最大距离可分码的构造矩阵入手,提出了一种求解采用非最大距离可分码编码的数据对象在丢失若干块后数据对象的可修复概率算法。该算法穷举丢失若干块的所有可能组合,并在生成矩阵中判断每种组合相对应的矩阵是否可逆以计算可恢复的概率。随后采用马尔可夫理论,针对此类系统建立较为通用的度量存储系统可靠性的理论模型。该模型能够量化非最大距离可分码容错配置、存储规模、修复带宽、单节点可靠性、单节点容量对存储系统可靠性的影响。最后采用数值分析的方法,以局部修复码为例验证了模型的正确性,比较了不同因素对存储系统可靠性的影响。本模型为采用非最大距离可分码的存储系统的设计和实现提供了理论基础。

关键词: 存储系统, 可靠性, 非最大距离可分码, 马尔可夫模型, 平均数据丢失时间

Abstract:

An open problem is how different configurations influence the reliability of a storage system using non-maximum distance separable codes as redundancy strategy.This paper proposes a repairable probability algorithm for solving data objects with non-maximum distance separable code encoding by considering the construction matrix of non-maximum distance separable codes.This algorithm exhaustively loses all possible combinations of several blocks and judges whether the matrix corresponding to each combination is reversible for calculating the probability of recoverability.We propose an analytical model based on the Markov chain to quantify the reliability of the non-maximum distance separable coded storage system.This model could quantify the impact of a series of design factors on the reliability of the storage system,such as the effect of non-maximum distance separable code configuration,the capacity of the storage system,the capacity of the object-based storage device nodes,the repair bandwidth,the mean time to data loss of the object-based storage device nodes and so on.Finally,the numerical analytical method is used to verify the correctness of the model and the influence of different factors on the reliability of the storage system.Our model enables system practitioners to decide the appropriate configuration based on their reliability requirements.

Key words: storage system, reliability, non-maximum distance separable code, markov model, mean time to data loss

中图分类号: 

  • TP302.1