Media Summary: MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention. Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. [CVPR 2026] Geometry-Guided 3D Visual Token Pruning for Video-Language Models
Cvpr 2026 Gaussianzoom Video - Detailed Analysis & Overview
MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention. Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. [CVPR 2026] Geometry-Guided 3D Visual Token Pruning for Video-Language Models We propose SmokeSVD, a diffusion-based framework that progressively reconstructs dynamic smoke from a single