VD-GR: Boosting Visual Dialog with Cascaded Spatial-Temporal Multi-Modal GRaphs

Published in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024