`Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning
Computer Vision and Pattern Recognition(2024)
Abstract
Current approaches in Generalized Zero-Shot Learning (GZSL) are built uponbase models which consider only a single class attribute vector representationover the entire image. This is an oversimplification of the process of novelcategory recognition, where different regions of the image may have propertiesfrom different seen classes and thus have different predominant attributes.With this in mind, we take a fundamentally different approach: a pre-trainedVision-Language detector (VINVL) sensitive to attribute information is employedto efficiently obtain region features. A learned function maps the regionfeatures to region-specific attribute attention used to construct class partprototypes. We conduct experiments on a popular GZSL benchmark consisting ofthe CUB, SUN, and AWA2 datasets where our proposed Part Prototype Network (PPN)achieves promising results when compared with other popular base models.Corresponding ablation studies and analysis show that our approach is highlypractical and has a distinct advantage over global attribute attention whenlocalized proposals are available.
MoreTranslated text
Key words
zero-shot learning,image segmentation,object detection,pre-trained models,few-shot learning
PDF
View via Publisher
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
Summary is being generated by the instructions you defined