Media Summary: In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ... In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your

Spatialuncertain Testing Vlm Spatial Limits - Detailed Analysis & Overview

In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ... In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your 발표일: 2025.09.04 발표자: 장재윤 제목: Why Is In this AI Research Roundup episode, Alex discusses the paper: 'Why Far Looks Up: Probing Open-weight Vision-Language Models are becoming more powerful, but the winning recipe is not always about adding complex ...

Authors: Cheng Yang; Rui Xu; Ye Guo; Peixiang Huang; Yiru Chen; Wenkui Ding; Zhongyuan Wang; Hong Zhou Description: ... The video introduces MindJourney, a framework that enhances Vision-Language Models (VLMs), which excel at interpreting ... MLV Group Seminar (24.09.09) [Paper] SpatialVLM: Endowing Vision-Language Models with Imagine showing an AI a picture of your messy room and asking it to help you organize it—or uploading a medical scan and ... Presentation video for our paper, SpatialVLM: Endowing Vision-Language Models with Sania Bidurukontam will be speaking about

Photo Gallery

SPATIALUNCERTAIN: Testing VLM Spatial Limits
SpatialReasoner-R1: VLM Spatial Logic
What Are Vision Language Models? How AI Sees & Understands Images
Why Is Spatial Reasoning Hard for VLMs An Attention Mechanism Perspective on Focus Areas (ICML 2025)
SpatialTunnel: Probing 3D Spatial Bias in VLMs
Open-Weight VLMs: The 5-Axis Recipe for Building Better Vision-Language Models
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
LBW131: Exploring Spatial Scale Perception in Immersive Virtual Reality for Risk Assessment in ...
Jonathan Wai and David Uttal: Why spatial reasoning matters for education policy | LIVE STREAM
View Detailed Profile
SPATIALUNCERTAIN: Testing VLM Spatial Limits

SPATIALUNCERTAIN: Testing VLM Spatial Limits

In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ...

SpatialReasoner-R1: VLM Spatial Logic

SpatialReasoner-R1: VLM Spatial Logic

In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves

What Are Vision Language Models? How AI Sees & Understands Images

What Are Vision Language Models? How AI Sees & Understands Images

Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your

Why Is Spatial Reasoning Hard for VLMs An Attention Mechanism Perspective on Focus Areas (ICML 2025)

Why Is Spatial Reasoning Hard for VLMs An Attention Mechanism Perspective on Focus Areas (ICML 2025)

발표일: 2025.09.04 발표자: 장재윤 제목: Why Is

SpatialTunnel: Probing 3D Spatial Bias in VLMs

SpatialTunnel: Probing 3D Spatial Bias in VLMs

In this AI Research Roundup episode, Alex discusses the paper: 'Why Far Looks Up: Probing

Open-Weight VLMs: The 5-Axis Recipe for Building Better Vision-Language Models

Open-Weight VLMs: The 5-Axis Recipe for Building Better Vision-Language Models

Open-weight Vision-Language Models are becoming more powerful, but the winning recipe is not always about adding complex ...

Improving Vision-and-Language Reasoning via Spatial Relations Modeling

Improving Vision-and-Language Reasoning via Spatial Relations Modeling

Authors: Cheng Yang; Rui Xu; Ye Guo; Peixiang Huang; Yiru Chen; Wenkui Ding; Zhongyuan Wang; Hong Zhou Description: ...

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

The video introduces MindJourney, a framework that enhances Vision-Language Models (VLMs), which excel at interpreting ...

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]

MLV Group Seminar (24.09.09) [Paper] SpatialVLM: Endowing Vision-Language Models with

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

MindJourney:

LBW131: Exploring Spatial Scale Perception in Immersive Virtual Reality for Risk Assessment in ...

LBW131: Exploring Spatial Scale Perception in Immersive Virtual Reality for Risk Assessment in ...

LBW131: Exploring

Jonathan Wai and David Uttal: Why spatial reasoning matters for education policy | LIVE STREAM

Jonathan Wai and David Uttal: Why spatial reasoning matters for education policy | LIVE STREAM

Most K–12

Vision Language Models (VLMs) Explained: The AI That Can Truly See!

Vision Language Models (VLMs) Explained: The AI That Can Truly See!

Imagine showing an AI a picture of your messy room and asking it to help you organize it—or uploading a medical scan and ...

Spatial VLM presentation, CVPR 2024

Spatial VLM presentation, CVPR 2024

Presentation video for our paper, SpatialVLM: Endowing Vision-Language Models with

Spatial Abilities: An Overlooked Essential Skill | Sania Bidurukontam | TEDxDVHS

Spatial Abilities: An Overlooked Essential Skill | Sania Bidurukontam | TEDxDVHS

Sania Bidurukontam will be speaking about