›› 2014, Vol. 27 ›› Issue (6): 13-.

• 论文 • 上一篇    下一篇

基于0-1规划模型的规则中文碎片拼接复原研究

沈鸿平,章毅鹏,王义康   

  1. (中国计量学院 理学院,浙江 杭州 310018)
  • 出版日期:2014-06-15 发布日期:2014-06-14
  • 通讯作者: 王义康(1976—),男,副教授。研究方向:复杂工业过程的建模与优化。E-mail:zjuwkang@163.com

Research on Regular Chinese Fragments Reassembly Based on 0-1 Programming Model

SHEN Hongping,ZHANG Yipeng,WANG Yikang   

  1. (College of Science,China Jiliang University,Hangzhou 310018,China)
  • Online:2014-06-15 Published:2014-06-14

摘要:

通过对中文规则碎片的文字分布特征进行分析,提取碎片边界信息,定义了基于欧氏距离的相邻碎片匹配度,考虑在所有碎片总体匹配度最大目标下,建立基于0-1规划的碎片拼接模型,并运用贪婪算法对某碎片文件进行模拟仿真。研究结果表明,基于0-1规划的中文规则碎片拼接模型,可准确地对碎片拼接问题进行数学描述,利用贪婪算法结合适当的人工干预可以较好地解决中文规则碎片拼接复原问题。

关键词: 0 1规划模型, 中文碎片拼接, 匹配度, 贪婪算法

Abstract:

By analyzing the fragments on the regular Chinese text distribution,extracting fragments boundary information,defining compatibility of adjacent fragments based on Euclidean distance,and considering the maximum compatibility of all the fragments,the fragment recuperative model is established based on 0-1 programming,and a greedy algorithm is used to simulate a fragmented files.The research results show that the regular Chinese fragments recovered problem can be accurately described with math based on 0-1 programming model,and greedy algorithm combined with appropriate artificial intervention can better solve this problem.

Key words: 0 1 programming model;chinese fragments reassembly;matching degree;greedy algorithm

中图分类号: 

  • TP391.1