Holi-Spatial
Evolving Video Streams into Holistic 3D Spatial Intelligence
Unified Video Stream Modeling
Integrates continuous egocentric clips into coherent spatial trajectories for holistic scene understanding.
Grounding + Depth Coupling
Jointly analyzes object grounding and depth estimation to expose cross-view consistency and failure cases.
Benchmark-Driven Evaluation
Connects visual evidence to downstream spatial QA, enabling robust and interpretable model comparison.
Geometric Visualization
Visualize scene geometry through synchronized 3D grounding and depth estimation. Swipe or click dots to explore different scenes.
Swipe or click dots to switch · Videos play in sync
3D Visualization
Mesh Comparison
Compare lightweight base-plus-delta scene geometry across methods. Drag to rotate, scroll to zoom.
Spatial QA
Spatial QA Demonstration
Comprehensive spatial intelligence QA data covering camera motion, object relations, and camera-object interactions — capturing diverse spatial reasoning tasks across real indoor scenes.