Paper_review[short]
[One-page summary] A Light Touch Approach to Teaching Transformers Multi-view Geometry(CVPR 2023) by Zisserman et al.
Elune001
2024. 1. 15. 21:37
β Summary: Multi-view geometry improves object retrieval performance
βApproach highlight
- Reranking transformer for object retrieval
-
Epipolar Loss and Max-Epipolar Loss: Using epiploar line to utilize multi-view image
$π΄^{12}, π΄^{21}$: cross attention map from last transformer π(π,π): indicator function. if (π,π)on epipolar line then 1, else 0
β Main Results
β Discussion
- No significant performance improvements (what is the drawback? Is it a really good idea to use the Epiploar geometry view?)