J4

• 研究论文 • 上一篇    下一篇

一种易于硬件实现的运动估计算法及其VLSI实现

赵波;杜建超;颜尧平   

  1. (西安电子科技大学 综合业务网理论及关键技术国家重点实验室, 陕西 西安 710071)

  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2003-04-20 发布日期:2003-04-20

A block-matching algorithm based on hardware implementation and its VLSI architecture

ZHAO Bo;DU Jian-chao;YAN Yao-ping

  

  1. (State Key Lab. of Integrated Service Networks, Xidian Univ., Xi'an 710071, China)
  • Received:1900-01-01 Revised:1900-01-01 Online:2003-04-20 Published:2003-04-20

摘要: 综合考虑硬件成本和运动估计的精度,提出了一种易于硬件实现的运动估计算法,称之为分层准全搜索法. 在串行输入以100%效率并行处理的硬件结构的基础上,进一步采用并行处理结构和流水线处理的方式,并结合了分层搜索的思想. 所使用的硬件资源是全搜索法的四分之一,而且降低了系统时钟,从而降低了成本. 实验结果表明算法得到的PSNR和全搜索匹配法可比,比其他快速搜索算法要好. 文中结合H.263图像编解码器的实现,提出了一种并行处理时的数据存储方案,大大节省了片内存储器,从而又节省了系统功率和成本,已用FPGA实现了这种算法.

关键词: 运动估计补偿, 分层准全搜索法, 并行处理, 流水线, 硬件结构

Abstract: Based on the cost of hardware implementation and the precision of estimation, a new block-matching algorithm named the Hierarchical Quasi-Full Search Algorithm(HQFSA) is proposed. HQFSA can be easily realized by hardware. A parallel architecture as well as a pipelining architecture is proposed based on one-dimentional array processor with sequential inputs and processing with 100 percent efficiency. The HQFSA combines the main idea in the full search algorithm with that in the hierarchical block matching algorithm. The hardware cost of HQFSA is a quarter that of the full search algorithm. The clock frequency is reduced as a result of the parallel architecture. The experimental results show that the PSNR of HQFSA is comparable to that of the full search algorithm and better than that of the hierarchical block matching algorithm and other fast matching algorithms. We have also proposed an data memory scheme for reducing the cost based on the recommendation H.263. We have already realized this algorithm using FPGA.

Key words: motion estimation and compensation, hierarchical quasi full search algorithm, parallel processing, pipelining architecutre, hardware architecture

中图分类号: 

  • TP919.81