Journal of Xidian University ›› 2025, Vol. 52 ›› Issue (2): 57-84.doi: 10.19665/j.issn1001-2400.20240907

Previous Articles     Next Articles

Review of deep learning-based methods for driving facial animation

LIU Long1(), LI Haosheng1(), ZHANG Mengxuan2(), DU Ying3(), CHANG Yaqi1(), ZHANG Wenbo1()   

  1. 1. School of Electronic Engineering,Xidian University,Xi’an 710071,China
    2. School of Artificial Intelligence,Xidian University,Xi’an 710071,China
    3. Beijing Film Academy,Beijing 100088,China
  • Received:2024-05-05 Online:2025-04-20 Published:2024-09-25
  • Contact: ZHANG Mengxuan E-mail:longliu@xidian.edu.cn;haoshengli@stu.xidian.edu.cn;mxzhang@xidian.edu.cn;duying@bfa.edu.cn;yqchang@stu.xidian.edu.cn;wbzhang@xidian.edu.cn

Abstract:

Facial animation technology aims to dynamically drive static facial images using source data such as audio or video to produce realistic animation effects.The development of deep learning technology has greatly promoted the progress of facial animation technology.This deep learning technology can learn and capture facial features and movement patterns,achieving realistic and personalized facial animation through an automated driving process.Currently,there are numerous research achievements in the field of facial animation based on deep learning.However,existing reviews focus mostly on specific technologies or single-modality driving sources.This paper systematically reviews the facial animation driving technology based on deep learning,summarizing the research status according to the process of audio and video driving facial animation.First,it introduces the common process of extracting facial features from input source data.Second,it deeply analyzes the key technologies of feature extraction and animate generation,and compares the advantages and disadvantages of different deep learning network architectures in each step.Finally,it summarizes the animation generation methods under different architectures and compares their similarities and differences.In addition,this paper also lists the commonly used datasets and evaluation metrics for facial animation,summarizes the existing challenges in the field,further elaborates on the development trends of future work,and makes some prospects,aiming to provide researchers with a more comprehensive perspective on the application of deep learning in the field of facial animation.

Key words: facial animation, deep learning, audio-driven and video-driven, virtual avatars, research review

CLC Number: 

  • N7