Media Summary: "FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution ... [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers.

Cvpr 24 Oral Metacloak - Detailed Analysis & Overview

"FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution ... [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers. Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image Segmentation. A Recurrent Vision-and-Language BERT for Navigation Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen ... We address the generalization ability of recent learning-based point cloud registration methods. Despite their success, these ...

Our paper on directly optimizing rank-based metrics (called RaMBO) using our method. We will present it at ... Please visit the project page for more information: We present GROUNDHOG, a multimodal large language model capable of pixel-level language grounding to a wide range of ... CVPR26 Poster: Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress.

Photo Gallery

[CVPR'24 Oral] MetaCloak
[CVPR-24 Oral] Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration
[CVPR 2024 Oral] FMA-Net: (...) for Joint Video Super-Resolution and Deblurring
[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation
[CVPR 2024] Constructing and Exploring Intermediate Domains in MiDSS
CVPR 2024 MemFlow
[CVPR 2024 Highlight] Diversified and Personalized Multi-rater Medical Image Segmentation
[CVPR 2026] ProcessMaker
Video for CVPR 2024 Paper
CVPR 2021 Oral: A Recurrent Vision and Language BERT for Navigation
CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models
[CVPR 2021, oral] PointNetLK Revisited
View Detailed Profile
[CVPR'24 Oral] MetaCloak

[CVPR'24 Oral] MetaCloak

This is the video presentation of

[CVPR-24 Oral] Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration

[CVPR-24 Oral] Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration

Oral

[CVPR 2024 Oral] FMA-Net: (...) for Joint Video Super-Resolution and Deblurring

[CVPR 2024 Oral] FMA-Net: (...) for Joint Video Super-Resolution and Deblurring

"FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution ...

[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation

[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation

[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation

[CVPR 2024] Constructing and Exploring Intermediate Domains in MiDSS

[CVPR 2024] Constructing and Exploring Intermediate Domains in MiDSS

[

CVPR 2024 MemFlow

CVPR 2024 MemFlow

CVPR 2024 MemFlow

[CVPR 2024 Highlight] Diversified and Personalized Multi-rater Medical Image Segmentation

[CVPR 2024 Highlight] Diversified and Personalized Multi-rater Medical Image Segmentation

CVPR

[CVPR 2026] ProcessMaker

[CVPR 2026] ProcessMaker

ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers.

Video for CVPR 2024 Paper

Video for CVPR 2024 Paper

Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image Segmentation.

CVPR 2021 Oral: A Recurrent Vision and Language BERT for Navigation

CVPR 2021 Oral: A Recurrent Vision and Language BERT for Navigation

A Recurrent Vision-and-Language BERT for Navigation Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen ...

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models

https://aka.ms/task-transfer-vlms.

[CVPR 2021, oral] PointNetLK Revisited

[CVPR 2021, oral] PointNetLK Revisited

We address the generalization ability of recent learning-based point cloud registration methods. Despite their success, these ...

[CVPR 2026] CarlaOcc

[CVPR 2026] CarlaOcc

CVPR

Optimizing Rank-based Metrics with Blackbox Differentiation (CVPR 2020) 5 minute oral

Optimizing Rank-based Metrics with Blackbox Differentiation (CVPR 2020) 5 minute oral

Our paper on directly optimizing rank-based metrics (called RaMBO) using our #blackboxbackprop method. We will present it at ...

[CVPR 2024] FlowVQTalker

[CVPR 2024] FlowVQTalker

[

[CVPR 2020 Oral] Quick Introduction to Deep Global Registration

[CVPR 2020 Oral] Quick Introduction to Deep Global Registration

Please visit the project page for more information: https://chrischoy.github.io/publication/dgr/

[CVPR 2024] GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

[CVPR 2024] GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

We present GROUNDHOG, a multimodal large language model capable of pixel-level language grounding to a wide range of ...

CVPR '26 | R2VLM

CVPR '26 | R2VLM

CVPR26 Poster: Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress.