Human Pose Transfer with Augmented Disentangled Feature Consistency

Kun Wu; Chengxiang Yin; C. H.E. Zhengping; B. O. Jiang; Jian Tang; Zheng Guan; Gangyi Ding

doi:10.1145/3626241

Human Pose Transfer with Augmented Disentangled Feature Consistency

Kun Wu, Chengxiang Yin, C. H.E. Zhengping^*, B. O. Jiang, Jian Tang^*, Zheng Guan, Gangyi Ding

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring the poses of one person to others. Though many different methods have been proposed to generate images with high visual fidelity, the main challenge remains and comes from two fundamental issues: pose ambiguity and appearance inconsistency. To alleviate the current limitations and improve the quality of the synthesized images, we propose a pose transfer network with augmented Disentangled Feature Consistency (DFC-Net) to facilitate human pose transfer. Given a pair of images containing the source and target person, DFC-Net extracts pose and static information from the source and target respectively, then synthesizes an image of the target person with the desired pose from the source. Moreover, DFC-Net leverages disentangled feature consistency losses in the adversarial training to strengthen the transfer coherence and integrates a keypoint amplifier to enhance the pose feature extraction. With the help of the disentangled feature consistency losses, we further propose a novel data augmentation scheme that introduces unpaired support data with the augmented consistency constraints to improve the generality and robustness of DFC-Net. Extensive experimental results on Mixamo-Pose and EDN-10k have demonstrated DFC-Net achieves state-of-the-art performance on pose transfer.

Original language	English
Article number	3
Journal	ACM Transactions on Intelligent Systems and Technology
Volume	15
Issue number	1
DOIs	https://doi.org/10.1145/3626241
Publication status	Published - 19 Dec 2023

Keywords

Human pose transfer
computer vision
generative adversarial network
image generation

Access to Document

10.1145/3626241

Cite this

@article{6bea0a5a36ef4aaa8ae63de132ea3005,

title = "Human Pose Transfer with Augmented Disentangled Feature Consistency",

abstract = "Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring the poses of one person to others. Though many different methods have been proposed to generate images with high visual fidelity, the main challenge remains and comes from two fundamental issues: pose ambiguity and appearance inconsistency. To alleviate the current limitations and improve the quality of the synthesized images, we propose a pose transfer network with augmented Disentangled Feature Consistency (DFC-Net) to facilitate human pose transfer. Given a pair of images containing the source and target person, DFC-Net extracts pose and static information from the source and target respectively, then synthesizes an image of the target person with the desired pose from the source. Moreover, DFC-Net leverages disentangled feature consistency losses in the adversarial training to strengthen the transfer coherence and integrates a keypoint amplifier to enhance the pose feature extraction. With the help of the disentangled feature consistency losses, we further propose a novel data augmentation scheme that introduces unpaired support data with the augmented consistency constraints to improve the generality and robustness of DFC-Net. Extensive experimental results on Mixamo-Pose and EDN-10k have demonstrated DFC-Net achieves state-of-the-art performance on pose transfer.",

keywords = "Human pose transfer, computer vision, generative adversarial network, image generation",

author = "Kun Wu and Chengxiang Yin and Zhengping, {C. H.E.} and Jiang, {B. O.} and Jian Tang and Zheng Guan and Gangyi Ding",

note = "Publisher Copyright: {\textcopyright} 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.",

year = "2023",

month = dec,

day = "19",

doi = "10.1145/3626241",

language = "English",

volume = "15",

journal = "ACM Transactions on Intelligent Systems and Technology",

issn = "2157-6904",

publisher = "Association for Computing Machinery (ACM)",

number = "1",

}

TY - JOUR

T1 - Human Pose Transfer with Augmented Disentangled Feature Consistency

AU - Wu, Kun

AU - Yin, Chengxiang

AU - Zhengping, C. H.E.

AU - Jiang, B. O.

AU - Tang, Jian

AU - Guan, Zheng

AU - Ding, Gangyi

PY - 2023/12/19

Y1 - 2023/12/19

N2 - Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring the poses of one person to others. Though many different methods have been proposed to generate images with high visual fidelity, the main challenge remains and comes from two fundamental issues: pose ambiguity and appearance inconsistency. To alleviate the current limitations and improve the quality of the synthesized images, we propose a pose transfer network with augmented Disentangled Feature Consistency (DFC-Net) to facilitate human pose transfer. Given a pair of images containing the source and target person, DFC-Net extracts pose and static information from the source and target respectively, then synthesizes an image of the target person with the desired pose from the source. Moreover, DFC-Net leverages disentangled feature consistency losses in the adversarial training to strengthen the transfer coherence and integrates a keypoint amplifier to enhance the pose feature extraction. With the help of the disentangled feature consistency losses, we further propose a novel data augmentation scheme that introduces unpaired support data with the augmented consistency constraints to improve the generality and robustness of DFC-Net. Extensive experimental results on Mixamo-Pose and EDN-10k have demonstrated DFC-Net achieves state-of-the-art performance on pose transfer.

AB - Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring the poses of one person to others. Though many different methods have been proposed to generate images with high visual fidelity, the main challenge remains and comes from two fundamental issues: pose ambiguity and appearance inconsistency. To alleviate the current limitations and improve the quality of the synthesized images, we propose a pose transfer network with augmented Disentangled Feature Consistency (DFC-Net) to facilitate human pose transfer. Given a pair of images containing the source and target person, DFC-Net extracts pose and static information from the source and target respectively, then synthesizes an image of the target person with the desired pose from the source. Moreover, DFC-Net leverages disentangled feature consistency losses in the adversarial training to strengthen the transfer coherence and integrates a keypoint amplifier to enhance the pose feature extraction. With the help of the disentangled feature consistency losses, we further propose a novel data augmentation scheme that introduces unpaired support data with the augmented consistency constraints to improve the generality and robustness of DFC-Net. Extensive experimental results on Mixamo-Pose and EDN-10k have demonstrated DFC-Net achieves state-of-the-art performance on pose transfer.

KW - Human pose transfer

KW - computer vision

KW - generative adversarial network

KW - image generation

UR - http://www.scopus.com/inward/record.url?scp=85183326749&partnerID=8YFLogxK

U2 - 10.1145/3626241

DO - 10.1145/3626241

M3 - Article

AN - SCOPUS:85183326749

SN - 2157-6904

VL - 15

JO - ACM Transactions on Intelligent Systems and Technology

JF - ACM Transactions on Intelligent Systems and Technology

IS - 1

M1 - 3

ER -

Human Pose Transfer with Augmented Disentangled Feature Consistency

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this