LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
Abstract
We propose a training-free method for open-vocabulary semantic segmentation using Vision-and-Language Models (VLMs). Our approach enhances the initial per-patch predictions of VLMs through label propagation, which jointly optimizes predictions by incorporating patch-to-patch relationships. Since VLMs are primarily optimized for cross-modal alignment and not for intra-modal similarity, we use a Vision Model (VM) that is observed to better capture these relationships. We address resolution limitations inherent to patch-based encoders by applying label propagation at the pixel level as a refinement step, significantly improving segmentation accuracy near class boundaries. Our method, called LPOSS+, performs inference over the entire image, avoiding window-based processing and thereby capturing contextual interactions across the full image. LPOSS+ achieves state-of-the-art performance among training-free methods, across a diverse set of datasets. Code: https://github.com/vladan-stojnic/LPOSS
Community
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- The Power of One: A Single Example is All it Takes for Segmentation in VLMs (2025)
- Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation (2025)
- LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation (2025)
- Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation (2025)
- Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models (2025)
- DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation (2025)
- Disentangling CLIP Features for Enhanced Localized Understanding (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 1
Collections including this paper 0
No Collection including this paper