Abstract: Building change detection (CD) from bitemporal remote sensing images aims to identify newly constructed, demolished, or modified buildings by comparing observations acquired at different ...
Abstract: Multimodal language models (MLMs) still face challenges in fundamental visual perception tasks where specialized models excel. Tasks requiring reasoning about 3D structures benefit from ...