J4 ›› 2013, Vol. 40 ›› Issue (3): 160-165.doi: 10.3969/j.issn.1001-2400.2013.03.024

• 研究论文 • 上一篇    下一篇

一种改进的高速网络分布式流量抽样算法

孙昱1;蒋馥蔚1;夏靖波1;赵小欢1;申健1;李明辉2   

  1. (1. 空军工程大学 信息与导航学院,陕西 西安  710077;
    2. 空军后勤部,北京  100720)
  • 收稿日期:2012-09-27 出版日期:2013-06-20 发布日期:2013-07-29
  • 作者简介:孙昱(1989-),男,空军工程大学博士研究生,E-mail: 458657673@qq.com.
  • 基金资助:

    陕西省科技计划资助项目(2012JZ8005);全军军事学研究生课题资助项目(2010XXXX-488)

Improved distributed traffic sampling algorithm  for high speed network

SUN Yu1;JIANG Fuwei1;XIA Jingbo1;ZHAO Xiaohuan1;SHEN Jian1;LI Minghui2   

  1. (1. School of Information and Navigation, Air Force Engineering Univ., Xi'an  710077, China;
    2. Air Force Logistics Department, Beijing  100720, China)
  • Received:2012-09-27 Online:2013-06-20 Published:2013-07-29

摘要:

针对基于掩码匹配的抽样算法中使用某些抽样掩码会造成较大测量误差的问题,提出了一种增强样本随机性的改进方案.该方案首先对标识字段进行异或运算,以得到新的标识值;然后再进行匹配抽样.从理论上证明了异或运算能有效提高结果的随机性,并用实测流量数据进行了统计验证.最后,从有效性和稳定性两个方面对改进算法进行了检验.实验结果表明,改进算法的各项评价指标均优于文献[10]算法,任意的抽样掩码均能在改进算法中使用.

关键词: 分布式抽样, 掩码匹配, 标识字段, 随机性, 异或运算

Abstract:

The sampling algorithm based on mask matching is a distributed sampling measurement technology which can be used in a high speed network. But there exists a problem that some masks used in the algorithm result in a large measurement error. In order to solve the problem, the paper proposes an improved scheme that aims to enhance the randomness of its sample. The idea of the scheme is to put the identification field into xor operation to get a new identity value before mask matching and sampling. Firstly, the paper demonstrates theoretically that the xor operation can enhance its computed result's randomness effectively. Then it is checked statistically with the realistic traffic data. Finally, the improved algorithm is tested from two aspects of effectiveness and stability. Experimental result shows that the evaluation indexes of the improved algorithm are better than those of the original algorithm. Hence, any sampling mask can be used in the improved algorithm.

Key words: distributed sampling, mask matching, identification field, randomness, xor operation