Computer Vision , AI

[One-page summary] A Light Touch Approach to Teaching Transformers Multi-view Geometry(CVPR 2023) by Zisserman et al. ๋ณธ๋ฌธ

Paper_review[short]

[One-page summary] A Light Touch Approach to Teaching Transformers Multi-view Geometry(CVPR 2023) by Zisserman et al.

Elune001 2024. 1. 15. 21:37
โ— Summary: Multi-view geometry improves object retrieval performance
 
โ—Approach highlight
  •  Reranking transformer for object retrieval

  • Epipolar Loss and Max-Epipolar Loss: Using epiploar line to utilize multi-view image

$๐ด^{12}, ๐ด^{21}$: cross attention map from last transformer ๐•(๐‘–,๐‘—): indicator function. if (๐‘–,๐‘—)on epipolar line then 1, else 0

 

โ— Main Results

 

โ— Discussion

  • No significant performance improvements (what is the drawback? Is it a really good idea to use the Epiploar geometry view?)