Object Label | LLaVA Caption | |||||
---|---|---|---|---|---|---|
# Img. | Viewpoint | Viewpoint | ||||
Training | Smart | Random | Training | Smart | Random | |
1 | 57.89 | 30.08 (-27.81) | 17.29 (-40.60) | 45.12 | 32.97 (-12.15) | 15.28 (-29.84) |
5 | 68.42 | 64.66 (-3.76) | 18.80 (-49.62) | 55.07 | 57.17 (+2.10) | 23.58 (-31.49) |
10 | 78.95 | 67.67 (-11.28) | 24.81 (-54.14) | 67.07 | 61.23 (-5.84) | 28.92 (-38.15) |
20 | 83.46 | 73.68 (-9.78) | 34.59 (-48.87) | 72.41 | 63.25 (-9.16) | 33.01 (-39.40) |
50 | 84.21 | 77.69 (-6.52) | 49.62 (-34.59) | 71.80 | 65.82 (-5.98) | 35.58 (-36.22) |
100 | 84.95 | 80.02 (-4.93) | 57.89 (-27.06) | 73.67 | 70.48 (-3.19) | 43.04 (-30.63) |
Training | Smart | Random | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
# Img. | %Grid | w/θ | %Grid | w/θ | %Train | w/θ | %Grid | w/θ | %Train | w/θ |
0 | 9.9 | 9.9 | 9.9 | 9.9 | 0.0 | 0.0 | 9.9 | 9.9 | 0.0 | 0.0 |
1 | 13.0 | 11.4 | 15.3 | 12.7 | 11.3 | 8.3 | 16.2 | 14.3 | 12.7 | 9.3 |
5 | 14.3 | 12.2 | 32.8 | 16.5 | 42.7 | 23.6 | 35.6 | 19.6 | 29.5 | 14.4 |
10 | 17.2 | 13.9 | 44.2 | 24.9 | 65.0 | 46.0 | 53.7 | 32.4 | 49.4 | 24.5 |
20 | 20.6 | 15.8 | 56.4 | 29.8 | 83.3 | 58.1 | 61.7 | 41.5 | 64.7 | 39.2 |
50 | 41.6 | 17.5 | 71.1 | 46.9 | 87.5 | 69.3 | 78.2 | 54.7 | 73.7 | 65.0 |
100 | 46.7 | 30.7 | 79.3 | 53.3 | 87.9 | 79.4 | 83.6 | 61.0 | 85.6 | 78.6 |
Ours | Baselines | ||||
---|---|---|---|---|---|
Stage | Action or Storage | SplatFacto | Nerfacto | LangSplat | LERF |
NGR Training |
Train Time (min) | 6.82 | 8.24 | 90.5 | 40.1 |
Model Size (MB) | 478.49 | 176.02 | 958.67 | 1282.4 | |
NGR Analysis |
Generate Visual Emb. (s) | 17.25 | 19.23 | 152.5 | 53.6 |
Database & Retrieval |
Embedding Size | 20.58MB | 20.58MB | 18.78GB | 225.4GB |
Retrieval Time (s) | 5e-5 | 5e-5 | 1e-3 | 17 |
# Img. | Training | Smart | Random |
---|---|---|---|
10 | 41.62 | 39.63 (-1.99) | 18.39 (-23.23) |
20 | 50.06 | 47.34 (-2.72) | 23.58 (-26.48) |
50 | 58.23 | 54.39 (-3.84) | 27.93 (-30.30) |
100 | 65.03 | 64.18 (-0.85) | 29.04 (-35.99) |
RGB | Clean Score | Next Viewpoint Selection | |
---|---|---|---|
Step 1 |
![]() |
![]() |
![]() |
Step 2 |
![]() |
![]() |
![]() |
@inproceedings{guan2025retri3d,
title={Retri3D: 3D Neural Graphics Representation Retrieval},
author={Yushi Guan, Daniel Kwan, Jean Sebastien Dandurand, Xi Yan, Ruofan Liang, Yuxuan Zhang, Nilesh Jain, Nilesh Ahuja, Selvakumar Panneer, Nandita Vijaykumar},
booktitle={International Conference on Learning Representations},
year={2025}
}