| [1] |
戴洋毅, 何康, 瑚琦, 等. CNN-Transformer混合模型在计算机视觉领域的研究综述[J]. 建模与仿真, 2023, 12(4):3657-3672.
|
|
DAI Yangyi, HE Kang, HU Qi, et al. A Review of CNN-Transformer Hybrid Models in the Field of Computer Vision[J]. Modeling and Simulation, 2023, 12(4):3657-3672.
|
| [2] |
ZOU Z, CHEN K, SHI Z, et al. Object Detection in 20 Years:A Survey[J]. Proceedings of the IEEE, 2023, 111(3):257-276.
|
| [3] |
SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-Scale Image Recognition (2014)[J/OL]. [2015-04-10]. https://arxiv.org/abs/1409.1556.
|
| [4] |
HE K, ZHANG X, REN S, et al. Deep Residual Learning for Image Recognition[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:770-778.
|
| [5] |
SZEGEDY C, LIU W, JIA Y, et al. Going Deeper with Convolutions[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:1-9.
|
| [6] |
DANTAS P V, SABINO DA SILVA JR W, CORDEIRO L C, et al. A Comprehensive Review of Model Compression Techniques in Machine Learning[J]. Applied Intelligence 2024, 54(22):11804-11844.
|
| [7] |
CHENG H, ZHANG M, SHI Q. A Survey on Deep Neural Network Pruning:Taxonomy,Comparison,Analysis,and Recommendations[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12):10558-10578.
|
| [8] |
LECUN Y, DENKER J, SOLLA S. Optimal Brain Damage[J]. Advances in Neural Information Processing Systems, 1989,2:598-605.
|
| [9] |
HAN S, POOL J, TRAN J, et al. Learning both Weights and Connections for Efficient Neural Network[J]. Advances in Neural Information Processing Systems, 2015,28:1135-1143.
|
| [10] |
LI H, KADAV A, DURDANOVIC I, et al. Pruning Filters for Efficient Convnets (2016)[J/OL]. [2017-03-10]. https://arxiv.org/abs/1608.08710v2.
|
| [11] |
WEI L, MA Z, YANG C, et al. Advances in the Neural Network Quantization:A Comprehensive Review[J]. Applied Sciences, 2024, 14(17):7445.
|
| [12] |
DETTMERS T. 8-Bit Approximations for Parallelism in Deep Learning (2015)[J/OL]. [2016-02-19]. https://arxiv.org/abs/1511.04561.
|
| [13] |
COURBARIAUX M, BENGIO Y, DAVID J P. Binary Connect:Training Deep Neural Networks with Binary Weights During Propagations[J]. Advances in Neural Information Processing Systems, 2015,28:3123-3131.
|
| [14] |
RASTEGARI M, ORDONEZ V, REDMON J, et al. Xnor-net:Imagenet Classification Using Binary Convolutional Neural Networks[C]// European Conference on Computer Vision. Berlin:Springer,2016:525-542.
|
| [15] |
HAO Z, GUO J, HAN K, et al. One-for-All:Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation[J]. Advances in Neural Information Processing Systems, 2024,36:1-13.
|
| [16] |
HINTON G, VINYALS O, DEAN J. Distilling the Knowledge in a Neural Network (2015)[J/OL]. [2015-03-09]. http://www.arxiv.org/abs/1503.02531.
|
| [17] |
CHEN G, CHOI W, YU X, et al. Learning Efficient Object Detection Models with Knowledge Distillation[J]. Advances in Neural Information Processing Systems, 2017,30:742-751.
|
| [18] |
POLINO A, PASCANU R, ALISTARH D. Model Compression via Distillation and Quantization (2018)[J/OL]. [2018-02-15]. https://arxiv.org/abs/1802.05668.
|
| [19] |
杨国威, 许志旺, 房臣, 等. 融合剪枝与量化的目标检测网络压缩方法[J]. 计算机工程与应用, 2022, 58(22):108-115.
doi: 10.3778/j.issn.1002-8331.2105-0134
|
|
YANG Guowei, XU Zhiwang, FANG Chen, et al. Object Detection Network Compression Method Based on Pruning and Quantization[J]. Journal of Computer Engineering & Applications, 2022, 58(22):108-115.
|
| [20] |
KIM J. Quantization Robust Pruning with Knowledge Distillation[J]. IEEE Access, 2023,11:26419-26426.
|
| [21] |
CHIU C Y, HONG D Y, LIU P, et al. Effective Compression of Language Models by Combining Pruning and Knowledge Distillation[C]// 2024 IEEE 48th Annual Computers,Software,and Applications Conference (COMPSAC). Piscataway:IEEE,2024:429-438.
|
| [22] |
HE Y, XIAO L. Structured Pruning for Deep Convolutional Neural Networks:A Survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 46(5):2900-2919.
|
| [23] |
HE Y, ZHANG X, SUN J. Channel Pruning for Accelerating Very Deep Neural Networks[C]// Proceedings of the IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:1389-1397.
|
| [24] |
NAGEL M, FOURNARAKIS M, AMJAD R A, et al. A White Paper on Neural Network Quantization (2021)[J/OL]. [2021-06-15]. https://arxiv.org/abs/2106.08295?context=cs.AI.
|