Spatial Reasoning in Multimodal LLMs via CoT Distillation and Monte Carlo Tree Search for Dutch Facade-Element Detection
Exploring agentic spatial reasoning in state-of-the-art Multimodal LLMs for Dutch building renovation, introducing DuTCh SpaCE to enhance step-by-step reasoning, self-correction, and reduce hallucinations.