Computer Vision , AI

[One-page summary] A Simple MultiModality Transfer Learning Baseline for Sign Language Translation (CVPR 2022) y Chen et al. 본문

Paper_review[short]

[One-page summary] A Simple MultiModality Transfer Learning Baseline for Sign Language Translation (CVPR 2022) y Chen et al.

Elune001 2024. 1. 16. 00:52

● Summary: Solving the problem of lack of sign language translation label data with progressive pretraining

 

● Approach highlight

  • Improve sign language translation performance with the Pretrain Language model
  • S3D backbone base visual encoder
  • V-L mapper for end-to-end training: simple fully connected 2 MLP layer

 

● Main Results

 

● Discussion

  • Can a Simple 2 MLP layer(V-L Mapper) efficiently represent visual features?