ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Computing Research Repository (CoRR)(2024)
NAVER Cloud KAIST | NAVER | Korea Advanced Institute of Science and Technology
The authors of this paper include Beomyoung Kim, who is affiliated with the Korea Advanced Institute of Science and Technology (KAIST) and whose research areas encompass continuous learning, visual prompt tuning, panoramic segmentation, image classification, and ImageNet. The second author, Joonsang Yu, focuses on neural network architectures, visual applications and systems, efficient learning and inference, visual prompt tuning, and continuous learning. The third author, Sung Ju Hwang, holds a position at DeepAuto.ai and specializes in machine learning, visual recognition, natural language understanding, healthcare, as well as meta-learning, semi-supervised learning, representation learning, transfer learning, and unsupervised learning.
Outline of the Paper
1. Abstract
- Panoptic segmentation combines semantic segmentation and instance segmentation.
- Continual learning enables the model to adapt to new classes, avoiding catastrophic forgetting.
- Propose the ECLIPSE method based on visual prompt tuning, reducing trainable parameters and simplifying the continual learning process.
2. Related Work
- The development history of panoptic segmentation.
- Challenges and existing methods in continual segmentation.
- Application of Visual Prompt Tuning in continual learning.
3. Problem Setting
- Definition and challenges of panoptic segmentation.
- Settings of continual learning in panoptic segmentation.
4. Method
4.1 Visual Prompt Tuning for Continual Segmentation
- Overview of the ECLIPSE method.
- Freeze the parameters of the base model, fine-tune the prompt embeddings.
- Deep and shallow prompt tuning strategies.
4.2 Addressing Semantic Confusion and Drift
- Issues of error propagation and semantic drift.
- Propose logit manipulation strategy.
5. Experimental Setup
- Datasets and evaluation metrics.
- Incremental protocol.
- Implementation details.
6. Experimental Results
6.1 Continual Panoptic Segmentation
- Comparison with existing methods.
- Performance in different scenarios.
6.2 Continual Semantic Segmentation
- Comparison with existing methods.
- Performance in different scenarios.
7. Analysis
- Impact of the number of prompts, computational complexity, visual prompt tuning, and logit manipulation.
8. Conclusion and Future Directions
- Summarize the contributions of the ECLIPSE method.
- Propose potential future optimization directions.
Appendix
A.3 Validation of Continual Panoptic Segmentation on the COCO Dataset
- Experimental setup and results.
A.4 Exploring Pre-trained Knowledge
- Impact of using different pre-trained weights.
Q: What research methods were specifically used in the paper?
- Problem Setup: The paper focuses on the problem of continuous learning in panoptic segmentation, i.e., how to avoid catastrophic forgetting of old classes while effectively integrating new knowledge (plasticity) during the continuous learning of new classes.
- Network Architecture: The Mask2Former architecture is used as the benchmark, which is a transformer-based model capable of handling panoptic, instance, and semantic segmentation tasks.
- Visual Prompt Tuning (VPT): A new continuous learning method is proposed, which freezes the parameters of the base model and fine-tunes a set of new prompt embeddings to identify new classes.
- Logit Manipulation: To address the issues of semantic confusion and drift, a simple logit manipulation strategy is proposed, which adjusts the logit values of the no-obj class to help suppress incorrect predictions and mitigate semantic drift.
Q: What are the main research findings and achievements?
- Research Achievements: The ECLIPSE method achieved new state-of-the-art performance on the ADE20K continuous panoptic segmentation benchmark with only 1.3% trainable parameters.
- Catastrophic Forgetting and Plasticity: The ECLIPSE method effectively addresses the problem of catastrophic forgetting and demonstrates prominent plasticity for new classes, especially as the number of categories increases.
- Computational Efficiency: Compared to methods relying on knowledge distillation and pseudo-labels, the ECLIPSE method simplifies the continuous learning process and significantly reduces training computational load.
Q: What are the current limitations of this research?
- Computational Complexity: As the number of categories increases, the expansion of the prompt set in the ECLIPSE method may lead to increased computational complexity.
- Hyperparameter Tuning in Logit Manipulation: The hyperparameters involved in the logit manipulation (such as δ) need to be carefully tuned to achieve optimal performance.
- Scalability: Although the ECLIPSE method performs well in current tests, optimizing scalability when dealing with large-scale category sets remains a challenge.
- Utilization of Pre-trained Knowledge: While the paper explores the impact of using different pre-trained weights, how to more effectively utilize this pre-trained knowledge to further improve performance still needs further research.

A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application
被引用1