Human Pose Transfer with Augmented Disentangled Feature Consistency

Kun Wu, Chengxiang Yin, C. H.E. Zhengping*, B. O. Jiang, Jian Tang*, Zheng Guan, Gangyi Ding

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring the poses of one person to others. Though many different methods have been proposed to generate images with high visual fidelity, the main challenge remains and comes from two fundamental issues: pose ambiguity and appearance inconsistency. To alleviate the current limitations and improve the quality of the synthesized images, we propose a pose transfer network with augmented Disentangled Feature Consistency (DFC-Net) to facilitate human pose transfer. Given a pair of images containing the source and target person, DFC-Net extracts pose and static information from the source and target respectively, then synthesizes an image of the target person with the desired pose from the source. Moreover, DFC-Net leverages disentangled feature consistency losses in the adversarial training to strengthen the transfer coherence and integrates a keypoint amplifier to enhance the pose feature extraction. With the help of the disentangled feature consistency losses, we further propose a novel data augmentation scheme that introduces unpaired support data with the augmented consistency constraints to improve the generality and robustness of DFC-Net. Extensive experimental results on Mixamo-Pose and EDN-10k have demonstrated DFC-Net achieves state-of-the-art performance on pose transfer.

Original languageEnglish
Article number3
JournalACM Transactions on Intelligent Systems and Technology
Volume15
Issue number1
DOIs
Publication statusPublished - 19 Dec 2023

Keywords

  • Human pose transfer
  • computer vision
  • generative adversarial network
  • image generation

Fingerprint

Dive into the research topics of 'Human Pose Transfer with Augmented Disentangled Feature Consistency'. Together they form a unique fingerprint.

Cite this