data:image/s3,"s3://crabby-images/051b7/051b70d653d2bc6b25a31cd7b81f8b9cf9e6431b" alt=""
Preprocess
K+1 frames
- K : train Embedder [B(batch) * K, 2, C, W, H]
- x : frame [B * K, C, W, H]
- y : landmark [B * K, C, W, H]
- 1(t frame) : train Generator
data:image/s3,"s3://crabby-images/f95ad/f95ad87cb3667e0931e90b9e0883f82efd0b5131" alt=""
e_hat = E(x, y)
x_hat = G(y_t, e_hat)
r_x_hat = D(x_hat, y_t, i)
r_x = D(x_t, y_t, i)
data:image/s3,"s3://crabby-images/be3da/be3da2ede3165d35f1dc5c34239234fc74bb6601" alt=""
LossEG = Lcnt(content) + Ladv(adversarial) + Lmch(match)
loss_cnt(x_t, x_hat) : feature의 차이
loss_adv(r_x_hat) = -r_x_hat.mean() + Lfm(feature matching)
loss_mch(e_hat, W_i)
loss_D(r_x, r_x_hat) = (relu(1+r_x_hat) + relu(1-r_x)).mean()
data:image/s3,"s3://crabby-images/a137a/a137a57e4d90663221c3956c781b9ec0967cb1d4" alt=""
data:image/s3,"s3://crabby-images/686e3/686e3c2a5ea1a724b48fa1d01516adbc9d474985" alt=""
data:image/s3,"s3://crabby-images/7cb51/7cb51009d201e22256de25c909cee75ae6a77588" alt=""
data:image/s3,"s3://crabby-images/61d5d/61d5d56196bc9ad9651eb02d616bb7fd943e91bc" alt=""
data:image/s3,"s3://crabby-images/ef898/ef898664ed0b205891dfe3141debdde08fafc73e" alt=""
data:image/s3,"s3://crabby-images/a0d3b/a0d3b989a6051b3a4590fc482b0cf166288162c5" alt=""
data:image/s3,"s3://crabby-images/3f988/3f988fa9311838836953378dd97f74b2ab5cbe33" alt=""