[DL輪読会]Pose Manipulation with Identity Preservation

>100 Views

May 08, 20

スライド概要

2020/05/08
Deep Learning JP:
http://deeplearning.jp/seminar-2/

シェア

またはPlayer版

埋め込む »CMSなどでJSが使えない場合

(ダウンロード不可)

関連スライド

各ページのテキスト
1.

DEEP LEARNING JP [DL Papers] Pose Manipulation with Identity Preservation Keno Harada, B4, UT http://deeplearning.jp/ 1

2.

● ○ ○ ○ ● ● ● Andrei-Timotei Ardelean, Lucian Mircea Sasu Transilvania University of Brasov Journal : INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL : : ○ https://arxiv.org/pdf/2004.09169.pdf https://github.com/TArdelean/CainGAN landmark , Face Reenactment SPADE .

3.

● Reenactment ● ● ●

4.

Reenactment: ● , , , ○ ■ Synthesizing Obama: Learning Lip Sync from Audio Image from The Creation and Detection of Deepfakes: A Survey

5.

Reenactment: ● , ○ ○ , photometric, geometric and kinematic complexity ● ○ Image from The Uncanny Valley

6.

Reenactment: ● Warping ○ ○ ■ ■ ■ ■ ■ , occlusion Bringing portraits to life View morphing Deepwarp: Photorealistic image resynthesis for gaze manipulation Unsupervised disentangling of shape and appearance X2face: A network for controlling face generation using images, audio, and pose codes ● Direct synthesis ○ ○ ■ ■ ■ Image-to-image translation with conditional adversarial networks Deep video portraits Video-to video synthesis

7.

Reenactment: ● ● few-shot, one-shot ○ ○ ○ ○ Few-Shot Adversarial Learning of Realistic Neural Talking Head Models Pose Manipulation with Identity Preservation <One-Shot Identity-Preserving Portrait Reenactment Neural Head Reenactment with Latent Pose Descriptors (CVPR 2020)

8.

: CainGAN

9.

:4 ● Generator ○ landmark, ● Targetted Embedder ○ , landmark, (Identity) ● Identity discriminator ○ ○ Identity source ● Pose discriminator ○ ○ Target landmark landmark

10.

: Generator ● Semantic Image Synthesis with Spatially-Adaptive Normalization ○ SPADE ● High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs ○ Pix2PixHD, Multiscale discriminator ● Which Training Methods for GANs do actually Converge? ○ ResNet

11.

: Generator: SPADE ● BatchNorm ○ ○ ● Semantic GauGAN semantic BatchNorm

12.

: Discriminator: Multiscale ● Discriminator, Discriminator Discriminator

13.

: Targeted Embedder Responsibitiliy Spatial embedding Identity embedding ●

18.

● Targetted embedder ● SPADE ○ Identity Embedding Semantic ● Semantic ○ Spade ● Few shot vid2vid ● ?? ??